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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATCTTATTC ATGTAGTTAA AGCACGTGGT ' 84 0 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TrTATTTGCC 900 
AAATTTGGAT TTCATAAGG 

(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 

ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTOTT ACTGGACCTA AAAATITAAA 60 

CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 

ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 24 0 

TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 

AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 360 

TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 420 

TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 480 
AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



518 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 



so 



CTTATTTAAA 


AGTAAATCAA 


TCAATGTATT 


ATAATCCGAA 


TAGTCCGCAT 


AAAGCTGGTT 


60 


TGCaGGCAAA 


tCAATTACTA 


CAACAAGCAA 


AAACCCAAAT 


TAATGCAATG 


rTTaATTCAA 


120 


AAACAAATTA 


TGATGTTGTA 


TTCACTAGTG 


GTGCAcTGAA 


TCCAATAATC 


TTGCTTTAAA 


180 


AGGTATTGCC 


TATCGTAAAT 


TTGATACAGC 


GAAGGAAATA 


ATTACATCCG 


TGTTAGAGCA 


240 


TCCGTCCGTA 


TTAGAGGTTG 


TAAGATATTT 


GGAAGCACAC 


GAAGGATTTA AAGTTAAATA 


300 



1500 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTACX3GTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 3 00 

GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 360 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 420 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 4 80 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 54 0 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 660 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 780 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT B15 

<2) INFORMATION FOR SEQ ID NO: 560: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 120 

TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 

TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 24 0 

40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 3 00 

AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 360 

TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 

GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 480 

GTGACGTACA TGATCCGTCT AGGTAA^lATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 540 

GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 

CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 660 

GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT GCTACGAGTT CAGATTCAGG ' 2580 

AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AATAATTATC 2640 

TCTTGTATTT tTTTCTAAAC TAGATTTTTC AGCATTGATA ACTTCAAGAC TATCTATATC 2700 

CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 2760 

ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCTAATTAAT 2820 

TAAAATACTA TAATTATATT TTATTTCTGT AAGTTTATGT GCCTCTATAT AGTGTAAATT 28 80 

ATACTTATTC ATTAGATAGT GTTCAAGAGC TTCATGTTTC TACATTATTA TATCCATTTT 2 94 0 

TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA GCCATATAAA AGTCAGGATC 3000 

AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AATATATGTT CATAAACTCT 3060 

TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 3120 

20 ATCTCGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 3180 

TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CT^CCTTCTA CTTGATATTT 324 0 

ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCG ACATGTGGCA CTGTGGTACT 3300 

TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT ATCATATCTT TAACTTTAAA 3360 

TATTTTTCCG TTGAGTGAAT GACATAATTT AGATGTTTTA CTATCTATTT TGGCAACATA 342 0 

TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyT 34 80 

TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 354 0 

AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTcGGG AAATTcTTGC TTCTCGATAA 3600 

AGTCTCTTAC TACAGAAAAA GACTTATTAC GATATAACAT AAAATATTCT TCATTTTCTA 3660 

TTTCAGATAA TGAGAAATCT ATAACCTCTG CATCTTTTTT ATTAAAAGTT ACTGAACCTT 3720 

TACCGTTACT TATATCATCT TCAGGTATAT ATTC 3754 

40 (2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 
(B> TYPE: nucleic acid 
45 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S 



30 



so 



ss 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 
ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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AAACTGGAAT 


ATC-AATAAAT 


TCi"n"i"i"rAG 


TCATCTTTTC 


ACTTTCATTA 


<j AA X ^ 1 A 1 1 A 


"7 ft O 




TAGTATAACT 


TGGTAATAAA 


GATGAAl l AT 


TTTTCTTATT 


AX X AX AX XLlA 


X X X vrXo^k-Ai./^ 


o ^ u 


5 


AAATATGACG 


C i~l"l'l\iCTTA 


ACATTATTCA 


ATTTt-ATt-(j i 




'PT ft /""in* ft T ft 
X XA^v^X rl 


7 w V 




TTTCAACaCAl 


TTCTTTTTTA 


GCTTTTTGTA 


GAAAGvjA i vjL. 


X AX X X XuXAA 




960 


10 


AATCTAATAC 


X l l' l lATCTC 


AGl 1 1 CAT CA 


AACljAAi AAA 


L.Aul..i>v\AA X v_ 


AVj X \^ X\> X X 


X V ^ v./ 


TCTACAACAA 


GATTTCCTGT 


TT CG CTCT CG 


AATTCATTTT 


L-A X Ao X L-ALj X 


Tft r* ft ft ft ppr^ 

X AL«AAMv_V^v. i. 


X u o u 




TCGTATTCTT 


CATTATTCAA 


AAG CGTGATA 


IV f TV TV ^ IV rp/ wi'MT» 

ATAACZA rc J. J. 


T A rv^ ft 'PA TV^O 
X ACVxA X A X V, 


AX V* XTIV-AAX 1 


X X^ U 


IS 


CTCAATACTT 


TTCACCTACT 


TATCAATATA 


TV ^/"^f TV /""T* A T" A 

AGGTAu 1 AlA 




/*i'T»/"*np*"p/'YI ft ftT* 
IjX^X XV-v>AAX 


T Odd 
X Z U U 




AATGGCACTT 


cCCTTTCTGG 


TTTCAATATA 


CACATTTTCG 


AxAX\jiXAX X X 


*nr»r*r^ft ftTft ftT 

X X^\k.AAXAAX 


x^ ou 




TTGATTAAAA 


TTAATAATCT 


CTTTCAAATC 


4k * TV *^/'V« /'**T' TV 

AAATCGCTCA 


TCACTT AA 1 A 


X XAljAX XVj^U. 


1 ^ o n 


20 


TGTTGACATT 


TTTTCTCTTA 


ACAATTCATT 


CAATAAATCT 


A*PA/^ A A IV /^♦P A 

AT ACxAAAG T A 


X X\» XAX AIjU X 


1 "5 p n 




AGGCAATTTC 


TTATTATTTA 


AAATGGCTCT 


U"1"1"1"1''1"ATTT 


TV •fP A A *T» A 

TCATTATAT A 


ox? X oA X ViA^ ^ 






TAACATATAT 


CTATTTTATT 


TTCCGCAATT 


TATTTCTATT 


X X X AX X X XAC 


\JAX XXX XAAX 


X D u u 


2S 


CATTTCTTTC 


ATCTGGTTTT 


TAGui'i"! i tC 


CTGtAATTAT 


GCTTCTTTTA 


CTTCTACTTG 






ATATTTACCt 


TCACGCTCTT 


TAAAGAACTT 


GTCCCGCCAA 


TTGCCAACAT 


\3 XvJGt-AC X\3 X 


xo ^ u 


30 


GGTACTTCTA 


CACCAAGGAT 


GCATAGGTGG 


CGCATTCACA 


CCTGGTATCA 


TATCTTTAAC 


1680 


TTTAAATATT 


TTTCCGTTAA 


GTGAATGACA 


TAATTTAGAT 


(.jX X X XAL.XAX 


X A X X X X uuv. 






AACATATTTA 


TATTCGCCAT 


CTTCACCAAG 


TTCTTTTAAA 


XAXVsX XAAL. X 


X X X\j X xjU- XXL. 


1 ft no 

X O w w 


35 


TGCATTTTCA 


GTAAATAGTT 


A IV IV IV fV IV /^^^T* 

AAAAAAGCGT 


A A A ATA ^ 

AT AAAAA 1 ACj 


lJAL.l.AV_ X X X 


^P^^^P^P^P ft 1^ ^^^P^5. 
X X X X AJvW X Vj 

X X AX AL. XXX 


X o o w 

X 7 ^ U 




TCTAAAAAGG 


ATGCTATTTA 


TU"!"!'!"! GAAT 


TTGAATTCTT 


X X XCo\- X X X X 




CAAATTCTTC 


AACTAATTCT 


TTAAAAGATT 


A / IV JV T'T*^ 

CACTv-AAI 11- 


XXX XuUAu X X 


ft. r* A TT*rp p ivT 


19B0 




CTAATTGTGA 


ATCTAACATA 


ATTAAAATCA 


TCTCACTTTA 


XAX X XAAX\^a. 


T* ft TT'P ft T ft fT 
XAX X XAXAwX 


2D40 

A> vy ** w 




ATAAAiji"i~i"i 


TTCAATAATT 


TTTCAATATG 


GUiAiUAi lA 


ft ft T ft TT* li li 
AXvjAXAX XAn 


X n X X nrv\ 


2100 




ATATTTAGCA 


CAAAATTTAC 


TCACTATTTT 


A /^i^ A TV^ A A A 

ACCA IajAAAv 


V X A X X X V7A\- X 


TW^Tft ftTft ft ft 


2160 


45 


TTTTACTTGT 


CCCTTATTAG 


TAACGATTGT 


CATTGATTTT 


A X X LjA 1 (jUA X 


1 w 1 u 1 . TV A A A A A 

ijv^ X X AAAAAA 


^ 4C A V 




TGTAAATAAA 


TCATATTCTG 


AAAATCCTGA 


CTGTCCAGGA 


TGGTTATGTA 


ACATAACAAT 


2280 


SO 


TGAATTCGGT 


TTACTGTTAA 


ATAATAATTC 


GGTTGCTTGT 


TACCCTGGCA 


CAAAAGATAC 


2340 


ACTATCTTGA 


TTGACATATA 


CTTTTGTAAA 


TTTACCATCT 


TTTAACAAAT 


AAGCTACTTC 


2400 




ATTGCTATCA 


TTGTTTTCCA 


TGAGATATAC 


CTCCGTTTAT 


AGtCTGTCGA 


CTGATATTCC 


2460 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 420 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 480 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 540 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 600 

ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA €60 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 720 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 7 80 

,5 AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 84 0 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 900 

ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 960 

GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATA_AGTAC CGGGTAAAGG 1020 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 1057 
(2) INFORMATION FOR SEQ ID NO: 558: 



2$ 



30 



45 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

3S CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 

GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 

GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 24 0 

CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 300 

GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 360 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 

TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 480 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 54 0 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 600 

TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGOAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 180 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 300 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT AGATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 4 20 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 4 80 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
3^ (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^0 <D) TOPOLOGY: linear 
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45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 12 0 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 18 0 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 240 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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TTACCTCTAG 


GCATGTCCCT 


TTCACGGTTT 


GCTTATGATA 


ACGTTATCGA 


C7VTTGTTAAA 


120 




TTGTCCm-l 


TGAAAAAATA 


ATTCTGGTGA 


GATAGACGCT 


ACTTGGaTTT 


CaTCaGTTTG 


180 


5 


TAAtAACGCA 


GtATAAtTAC 


CaTTACTAAA 


tTGaGTTAAT 


CGTTCaTATA 


AAGTACTAAT 


240 




AGGATAATAA 


ATGTTATCCG 


TTAAgCGCGC CGTATAGTTC ACTTGATACG 


TTTCGCCTTC 


300 


W 


AACAATTGCT 


TGTTGGACAC 


GTTTAATATT 


AGTCATCATA 


ACCTCAGAAG 


ATTCAACAAA 


360 




TGAAAAATGA 


TACTTTGATA 


CATAAGAAGT 


TTGATGTTCA 


TATGTTGAAT 


TTATGCTTTC 


420 




CGCTTTTTCA 


AAACTATAAG 


CTGCTGCATA 


AATATCATCT 


TTAGPTA ATr; 


AATGTGTACA 


480 


IS 


CATAGCATGA 


TTAAAATAPT 


TTGCCGCTTC 


GTAACTTAAA 


TATAAPr^AP A 


CATATCTACC 


540 




TTGTCGTkGt 


G CGCTTGTfJC 


AAAGTGTATC 


ACTTCTCTCCA 


PATPAf^ppa a 


CTTAGTAGCA 


600 




ACATACTTCT 


TTATAAATC P 


CTTTAATTGA 


ATATGGTATT 


P PTT* A T A T'T 


ATTTTCAGTT 


660 


20 


AAATAGTAG C 


GATAATTATA 


TTCTATTCTC 


ACAGTAATCA 


PPTAPPTTPn 


ATAAAAATAA 


720 




TTCAACTTGT 


CG AT A ACC^CZT 


ATTCACTCAA 


AATAGATTCA 


\JX3n X Vj>\ X n. X J. 


GCACACCAAA 


780 


2S 


AACCGGAAAT 


CTAATATfi PT 


CAAATGCCAT 


AATAATCGCT 


T PATPf^TTTT* 
X v_n X X X X X 


TTGCTGTAAT 


840 


CTTTAAGCAA 


TTTWSAAAAr: 


TCGCTCCGTC 


AGCAATTAAT 


GAATTS AT A A P 


GCATTACATT 


900 




GAAATTTTGA 


GG CAGT C PTT 


GAAAAATACC 


TTCATTGGTA 


TGGCGTAA PT 

vxvj^vj x^%,n^ X 


GTGTAGTATG 


960 


30 


TCCGTGTACA 


GG ATG AT AC; C 


CGTGAATGAT 


ATTTCCACCA 


AAATAAG A PA 


CGATACATTG 


1020 




AAATCCTAAA 


CATACACCTA 


GTATAGGTAC 


ACGCTGATAA 


*AirWX X U X X 1 r\ 


ACACTTCATT 


1080 




CAAGATAGGA 


TAATCATCCn 


GaTTACCCGG 


CCCAGGCGAA 


ATAACAATTG 


CTTTTGGCTT 


1140 


35 


CATATTAATG 


ACGTCTTCTA 


TCAGCAGATT 


ATCAATACCA 


ACAACTTGAA 


CTGTTAGTTT 


1200 




CGTTTGAGTC 


TTAATATAGT 


CTATTAAATT 


ATATGTAAAT 


GaATCATTAT 


TATCTATGAC 


1260 




TAGAATCATT 


fiT AT A prrrr* 


TTCTAAATGT 


GTTTTATTTT 


TATAATATGT 


ATTGGATGTA 


1320 


40 


GCTAAAACTT 


TAAAAGCATT 


GTCATTATCC 


TGACCTTGAT 


TTTAACTAAT 


ATATGGTATA 


1380 




TTCTATTCAT 


CGTACATAAA 


TGAATATCAG 


AGGTTCCTAG 


CTGAAACCCT 


CTATAAAAAA 


1440 


45 


CTAGGCCATT 


GAAATTTCAA 


ACATTCGTTG 


GG 






1472 


(2) INFORMATION FOR SEQ ID NO: 556: 









(i) SEQUENCE CHTU^CTERISTICS; 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 60 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 

ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 180 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 240 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 300 

ATTAT C TTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 360 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 4 20 

TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 4 80 

TTAGATTOTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 540 

TTTCTGTGGA AGTTCGAATA AGGACACCAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 66 0 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 720 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 780 

CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 840 
GCXSCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 
GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 960 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 1020 

GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 1080 

CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 1200 

ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 1260 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCPJKJ^GMJ^ TTTGATTCAA 1320 

TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ ID NO: 555: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 555: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOtXXSY: linear 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 
TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 
ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 
GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 600 
ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 660 
AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 720 
TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 
75 ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT 84 0 

AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 900 
CTTTTAGAAG TATGAAA 

(2) INFORMATION FOR SEQ ID NO: 553: 



917 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 6 0 

TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 12 0 

3S AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 24 0 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 3 00 

AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 36 0 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 42 0 

ATCTCTTTCT CT 432 
(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TGCTGACATT ATTATAAATG CTTTACCAGA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3 960 

AAAACATTTT GAATTAATGA AAGATG/^GC ACTTTTTATA AATATAGGAC GAGGTAGCAT 4020 

AGTTAAAGAA GCGCTCTTAA TAGT^GTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4080 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 4140 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4200 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4260 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4380 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CXSACAAGcAG 4440 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4500 

20 ATCCAACAsC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4560 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4 680 

AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4740 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAGTTTGTA 4 800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4 860 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4 920 

XTT 4923 
^ (2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 <D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 60 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 12 0 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

CGCATTTTGG TACCCAAAAt GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 240 

GTCGATTGGT TTCATTAATT GAaCACCATT TTGCTCTTCA GCAAGACCTT TGTCTACTTT 300 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 216C 
CAATTGGTTC TACTAATACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCrTTAT 222C 
AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 2280 
CAGCTGAGTC TGGAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 2340 
CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATCCAC 2400 
GTGCAACACX3 AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 2460 
CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 
15 TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 2580 
CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTCCCATCGA 2640 
CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 2700 
CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTAHAAT ATZiT>Tr-r:wr rzi^r^nr^rrrnr^^ 
GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 
TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTCCAAAAA 
GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 
ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 
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ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3060 
GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 3120 
TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 
35 AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 3240 
ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 
AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTCA 3360 
AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 
ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTrGGTT ATGATGGCGG 3480 
TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3540 
GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3600 
AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3660 
5P TAAAAAGATG AAACTATCAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3720 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3780 
AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 840 
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ATGTGGACAA TAAATCTTTT TTCTGCGGAT TATCTAACTC ATAATCAACA TGTGTCACAT 360 

TATAACGTGC TTTTTTAGAA AGGCTAGCTA AAATTTGCTC GTGAAAAGCA GTTAATGAAT 420 

CTAAATCTAA TTTGATTTGT AATAGGAAAT TGTTATTAAG TAAATATAAG TCGTTTTGAT 4 80 

AACGCGACAA TTTCTTTAAT ACTTCATAAG CTTGTCTAGT CGTCTGAACT ACTTCTCTGA 540 

AAAGTATTTT CTTTCTATTC TGTTGGTGAA TATGTTTTTT TGTAATAGGA CGTTCTTCX3C 600 

TATAGTAATC ATAAATTTTC TCTAACTTTT CXa^CACGTTG TTTTAAATTA TGACTATCTT 660 

GTTTAATATT ATTAAACTCC GTCGTATCAT TTAATACTAA TTTAAACCAC ATAAAAATAT 720 

CTCAGGATAT ATTTAATGAA TTATAGTAAA TTTTTGTTTC AAATTTAGGT GGTAGAAACA 780 

CAAAGTTAAC TAGAGATGAA CTTATGACAC CAATCATTAC AAGTACAAAC CTGTAAAAGG 840 

CGGTAATATA GAAAGAACCG GTATGTTGTC CCATAATGAT TAATGCTGTT ACACTCGCCA 900 

AAGTAGCAAC ATCTGCTAAA TTAAATTTAA ATAAAATAGC AATAAGTACT ATGACGGTAA 960 

CACCCATAAT GATAAAATTA TCACTAAAAA TTGTTACCAT TGTAACAGAT AGTATGGCAC 1020 

CTATAATGTT ACCCAATGCT TGATCAGAAA CTGTTTTAAA TGAACGATAA ATACTAGGTT 1080 

GCATTGCACA ACAGCACTGA CACCAGCTAA GGCTTTCAGA CCAACATCAT CCGGTAGTAA 114 0 

AGAAGCGATA GACATAGCTA AAATAATGGC TATACCAGTT TTAAAAATCC GAGCTCCTAG 1200 

TCTCAAAAAT AATGACGCCC CTTTTAAGTT TATTGAATAT CTAATATTCG TATTCATTAC 1260 

TGTTATACAC TTACTAGTTA CAAAATTCAA GCTTATTTAT AGTTGTTAAA ATAAATCATA 1320 

CATAATACTG ATAGCGATGT AAAACTTTAG TCAGAGATTA AAATAGTATA AATTTGTAAA 13 80 

ATAAAAACTC ACATAGTGAC ATATCAAGTT AAACGTTAAT AGTTAACGAT ATAAAATGAA 1440 

TCTACTATGT GAGCATTTGC TTTATTTTAA TTCAATTAAA AATATACTTC CTTAAAAGTT 1500 

ATTTCATTTG ACTAAAAGCA TAGTCTCCAG CTTTTAAAGT TTGTTTAATA TCTTCTTCTG 1560 

40 TATGTTCAGT TGTTAAGAAC CAAGCTTCAA ACTTAGAAGG TGCTAAATTG ATACCTTGAT 1620 

TTAACATTAA TTTGAAAAAT TTACCGAACG CTTCGCCGTC AGAATGTTCA ACTTGATCAT 1680 

AATGTGTGAC TTTTTCATCT GTAAAGTACA ATGTTAAAGA TCCATAAATA CGATTAATTG 1740 

TAGCTGTGAT ATTATGTTTT TCGATTAATT TAAGTAAACC TTCTTCTAGT TGTTGGCCTA 1800 

AGCTGTCTAA TTTITCATAA ACACCGTCTT GTTCTAGTAC TTCGAGTAAT GCAATACCTG I860 

CTTTCATAGA TAACGGGTTA CCAGCCATTG TACCAGCTTG ATATGCAGGT CCTAGAGGTG 1920 

CTACTTGTTC CATAATATCT TGACGTCCAC CATAGCCTCC AATTGGTAAA CCACCGCCAA 1980 

CAATTTTACC AAATGCAGTT AAATCAGGGA TAACACCTAA TAAATCTTGA GCGGCACCGT 2040 
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TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC AATTGGACCT GGCAACTCTT 
TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA TTTATCTAAG TCATATGCAT 
AGAAAATTTG CGGACGCTTT AATACACCGA AGTCGAACyVT GACAGATGAG TAGTCGGTAA 
CTAACGCATC GCTGATTAAG TATAAATCCX3 AAATGTCTTC ATAAtCTGAA ACGTCTTrCA 
CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA ATGCATGCGT AAkAAAATAA 
CATArTCATC ATCCAGCGCT TGACGCAAAG CTTCTATATC AAAGTTAACA TTAAATTGAT 
ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG CGCGTACATA ATCACITTTT 
TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT ATATTCAGTA TCATTGCGTT 
TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA AATCTTATCT CGACTAACAT 
GAAATGCATT TTGAAATATC GATGTCGAAT ATCGATTAGG TCACACTAGA TAATCCCACC 
GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT ATTTGTTCCT AGCATTTTAA 
CGTTACTAAT ATCCAAACCA ATCTTTTTTA ATGGCGTGCC ATCCCATGTT TGTAAGTACG 
TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT GTTAATCATC CwCGCTTTCG 
CTCTTGGCAT CGCTAAAAAC CATTTCATTG AAAACTTTGT AACATATGGT ACATTGTGCT 
GTTGGAATAT GTGTTCATAT CCTmTTCA CACCCCATAT TAAirGGGCA TCGCTATGTT 
CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA TTGTTTACCA TGAAAGCTTT 
CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA TTTAAAAGTC GTATATATAC 
TATGTTCTAT CAATTTTTTA ATTGTATTTT TAATCATGTC GTACCTCCGA CGTGTTTTTG 
35 TAATTATATT AATATGTATG AGC 

(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 
40 <A) LENGTH: 4 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 
CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT nTTAAATTT AATTTCGAGA 
so TTCAATATTA GTTTATCTCA TTGTGGCATT AATTCATAAA ATTGTTTTAA TCTTATAAAT 
CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG PAPAI^GCTtA 
TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 84 0 

ATTCACGTGT CGCACtAGTA ATTACCAnAG T 871 
(2) INFORMATION FOR SEQ ID NO: 54 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9: 

TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 

TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 240 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 300 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 360 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 400 
(2) INFORMATION FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 

CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 180 

TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 240 

AAACATTICG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 300 

TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 360 
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(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 
TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTT^CAGGTT TAATTTCAAA 
GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 
ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 
AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 
AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 
TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 

.>..»^c^ix v-n,v.v.ai AAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 
TTCTTTA 

(2) INFORMATION FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 8: 
TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 
TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 
CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 
TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TITAGTATTT AACTTAAAAG 
AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 
TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 
GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 
TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 
AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 
CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 
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GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 
TGAGCAAGAA ATCAATGCAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 
AAAAGAAAAA TGTGCCAGAA TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 
AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 
CTTTAACAAT ATCCXKSACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 
TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 
AAAGTTATTC TAATCGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 
GTGTATTCTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 
TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 
GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 
AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 
AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 
AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 
GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 
AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 
ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 
ATTCTATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 
CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 
TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 
CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 
AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 
TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 
TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 
AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 
TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 
TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 
TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 
TTGCTCATTA TAATGA 

(2) INFORMATION FOR SEQ ID NO: 547: 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA ' 96 

J^CMATTPJ^ AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAGTAAT 102 

TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGGATGCAT I08i 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTCTCA TGAATCCTAT CCCAATCTCC 114, 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCGCAACA TAACCAAATG 120( 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAATCACAT 126( 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 132( 

IS TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTTAGGCTT 138( 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 144 ( 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 150C 

AGAATTAATT GACCAAGTCG CTTArTTraT Trjar-or-rvr^R^ r.R.na~,.»^«. 

- — •"^^-""•™-» i xrtlH_rt V3HjV_ljr/\l\iAT 156C 

TCGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 162C 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 1680 

GGATT-TGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 1740 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 1800 

3„ AGGTGATGCA TATGAATTCC TAAlTGGGCG CTTTGCGGCG ACAgCGGGTA AAAAAGCAGG 1860 

CGAGTTCTAT ACACCACAAC AAGTATCTAA GATACTCGCG AAGATTGTCA CAGACGGTAA 1920 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

3S TGGTAAAGAA ACACAAGTGT ATCXJTTATTT CGGTCAAGAA CGTAACAATA CTACATACAA 2040 

CTTAGCACGC ATGAATATGT TATTACATGA TGTGCGTTAT GAGAACTTCG ATATCCGTAA 2100 

TGATGACACA TTGGAAAACC CAGCCTTrTT AGGCAATACA TTTGATGCGG TTATTGCGAA 2160 

40 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 2220 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 2280 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 2340 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA TTTAATTCAA GAAAAGAACT ACTTAGAAGC 2400 
TGTGATTGGT TTGCCAGCGA ATATTTTCTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 
SO ATITAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 
TGAAAAAGGA AAAAATCAAA ATCATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 
ATATAAGCGT AAGGAAACAA TTGATAAATA TAGCTACAGC GCGACACTAC AAGAGATTGC 
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GATCAATATA GCGGTAAAAA GCTAGGTAOG ATTAAATATG ATGACTACGG TATTATTGCT 420 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 4 80 

ATCATTAATT TATTT G T A TG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 540 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 84 0 

CrotCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
(2) INFORMATION FOR SEQ ID NO: 546: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

GGCATTTGTG TCCTTATATA AGGAACTGTG tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

ATTAAGAATA ATTCAAGTAT ATTTAAATCG AGGTTAATTA TCGTATGAAA CGATGCACGT 240 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA ATCCAATATA TTAAGATTGG 300 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

CAACAGGAAC TTTAACAACA GAAGTTCATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 420 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 4 80 

AAAATATTAG TG CT T T GAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 540 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 

GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 660 

GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 780 

AAATAACACT AAAAGAACTG GATTATAAGT TGAGAAAGCA TCTAATTGAA AAATACGGAC 840 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC 
CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT 
AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTIT 
GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA 
TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA 
ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA 
GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA 
AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA 
AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA 
TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT 
CGTTTGAATT GCACAAAAAA TTAGAAGAAA ATCGCATGGC 
ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA 
ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT 
GTTCQATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA 
AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA 
CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA 
AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA 
AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA 
(2) INFORMATION FOR SEQ ID NO: 54 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GCCACAACCA 
ACCAACAGAA 
AGAATTTAAA 
TGTTGTCCCA 
TGGTGAAGGA 
TCTGGAAAAA 
AGCAGGAGTA 
ATTCAAGATT 
ACTTATTGAA 
GAAAAACGGT 
AGATGTCATA 
TTCTCTAAAT 
TGCTTAGCTT 
TATGACGTGG 
ACGGGATCAG 
ATGATTATTA 
TGATTCAATA 
AATGACAGCA 



ATGCAATCTA 
ATAAATCCTA 
AATGAGATTG 
GATTATTTCA 
GTACATAGGA 
TATTCTGTCG 
AGAATTACTA 
ACTAAAGAAC 
AAAAATAATC 
GGAAAGTACA 
GATGGCACTA 
AGAAGCTGTC 
CTTTTGTTAT 
AAGAGACCTG 
TTTTATTTAA 
AACATGGTTT 
GACGTAAGCG 
ATTGCGAAAG 
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4320 
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4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

522.0 

5280 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 
AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 
AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 
OCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 
TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 
AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 
ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 
GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 
AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 
AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 
TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 
AAAGGCACCA AATACTAATG AGGAA7VAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 
ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 
ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 
AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 
AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 
ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 
GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 
GAAAAAATAT AAAGATGGAC CTTACGATAA TATCGATGTA TTTATCGTTT TAGAAGACAA 
TAAATATCAA TTGAAAAAAT ATTCTGTCX^G TGGCATCACX3 AAGACTAATA GTAAAAAAGT 
TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 
TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 
GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 
TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 
TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 
ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 
TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 
TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACGA AACTTCAAAT GTGCCAAGTG 
TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 
CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACG CCATCTATAA 
TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 
CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 
GTGTAATCAC AACGACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 
AAGCACCACA ATCAACACCG CCCTCAACTA AAATAGAAGC ACCGCAATCA AAACCAAACG 
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2700 

2760 

2820 

2880 
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3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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3660 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 
TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 
AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 
ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 
ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 
GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 
TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 



AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 114C 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 120C 

GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 126C 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 132C 

AGCGACGTAC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 13 80 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 1440 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 1500 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 1560 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TGTAAACACT ACAACGGAAA 1620 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 1680 

3S CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 1740 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 1860 

^° TTPA1GP>.GGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 2040 

4S 

TTGATTTTAA ATTAAGfiAAA TACCTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 
CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 
AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 
TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 
AGAGGTTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 
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TACCTTTAAT ATTTTCTAGT 
GACAATTCGG ACAGTATAAA 
^ ATTATGTTGT TTTGTGTTTG 

TAAATAATAG TCX3ATATTTC 
GAAATAGGAG GGAGACATTT 

10 

TCCTCGCATT TAATTGAACT 
CCCAAAGGTT TACATAATTC 
CATTGGCAAC AATATCAATT 

IS 

ATTGGTATTT GTGGATATGA 

TATAAATTAC ATCCAAGTTT 

20 ACAAATTTTG TGAATTATAT 

AACCAACGTT CAATAAATTT 

GACATTATCA ATCAAGGAGA 

AAATAAAATT AAAA 

(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 
TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 60 
ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 120 
TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA 180 
AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

45 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 30 0 

TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 3 60 

so AATAAAAATG GTATAAAGTG TGATTTGTAT PiAPiMiAGAGT CTCGACGGAT AAGAATTGAT 4 20 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAT^TGAGG 480 
TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA TATTACGTAA 54 0 

55 



ATCGAAAGAA TTGACACTAT TAGTACTGCT ATGGAGTTAA 114 0 

AGGAGAACCT GGTACGTCAA AAAACAATTA AAAAAAGATG 1200 

ACGTTAATAC TTCTGATGTT AGTAGTTACA TTATTCTTTT 1260 

AACCCGTGGC ATTAGTATTC ATATAAATAG TCTTTAAATA 1320 

AATGATAAAT ACTGAAAGAT TAAATTTAAT GATTCCAAGT 1380 

TTATAATATT TGTAGTCATC CACAAGCAAA TATATACACT 144 0 

CAAATTAGAC ACACAACGGT GGATTGAAAA ATGGCGAAAC 1500 

TGGTTACTTT GTATTGGTAA AAAAAATAGA TTGTAGTGTT 1560 

ATATCGACAA TTAAAGCAAG AaACAGTATT AAATTTATTT 1620 

TGAAGGACAA GGGTACGCAT GTGAGGCTAT TACAGCAATC 1680 

CGATCAAGAA ACAGTAAAAG TTATCAGGAC "AAATAAGTGT 174 0 

AGCAGAAAGG CTTAAATTCA AGCGAGACGA TACTATGGAC 1800 

TATTGTGTTT TAAnAATAAA ATACTATGAC ATTATCTAT^ 1860 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 
ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATCAA CGAAATATGT TTAAGTCGAA 
CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 
ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG ITCCTATGCA GCAAATGCAT 
ATCCTAATCC TTTAACATTG GCATTnCTGC AAATCAACGC ATAGAATCCA TTCACTGTTA 



(2) INFORMATION FOR SEQ ID NO: 54 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2160 
2220 
2280 
2340 



*-*v-*wx- .r«>rt A M 1 1 ^jiJA I-ICACTCTTA 24 00 

ACTTTTTnCA ACAAATGTCT nACATG 



2426 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 6C 

GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 12C 

GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA TAATGAGATA 18C 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 24 0 

ATGCATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 3 00 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 360 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 420 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 540 

TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 600 

rTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 660 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 720 
ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 
GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 
GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 
GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 
ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 
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GTATPiMiPiAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 360 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CX5AGGCAGTT AAAATTCTAA 4 20 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 4 80 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 540 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAAOGAA ATTAGGAAAA GCTGAAGCAG 600 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CX5TATAGAAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 84 0 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 1020 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 1140 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 1260 

AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 144 0 

TCTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCXS AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 204 0 
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40 



4S 



SO 
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CX5ACTTCGTA AGTGTGCTCA ATCTCGCCTG CATATGTCAC AGTAAGAGTA TCTTTCTX3TG 
TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA rrGTTTAGCT 
TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 
TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 



1200 
1260 
1320 
1380 

CTTCTCTGCC ATCAGAACGC ACAATTGT 
10 1408 



(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 
GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 
AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAAIT ACTTAAGAAA 
TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTr TAGGCTTGTT 
ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 
TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 
CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 
TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 



(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 
ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 
AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 
GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 
AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 4 80 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 525 
(2) INFORMATION FOR SEQ ID NO: 540: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1408 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



30 



35 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

TTGATTTGCT ACAAAGTATC TnCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT 60 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

AAGTTCTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATTCCT TATAGAATAT GAATGTTGCG ATACTTGCGA ATCCTGCAAT 240 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 300 

TAGTACTGCT ACTAGGTAGC TTTTATAAAT GTTACTCATT TTCTTTTTTC AACTCCTCCA 3 60 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 420 

TTCTGCCTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 4 80 

TriGTCCTGC TTTCGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 540 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 600 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 660 

TTCATAGATT TCriTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTTTTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 7 80 

45 ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA 840 
AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCGTTTCTGC TGTAGACTCT 900 
TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC 960 

TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 1080 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTCCATT GTrrcCTACG 
ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTCCrrCTTC TCTTAAATCA 

TGTGTTrrrr ctaatttaaa aagtgcaccg ggaatggtac ccgaggaacc agctgttggc 

GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 
CGTCAATCAT TTCATATCCA GACAAAGCAT GATCTGTTTC ATTATAATCA CGTAGTTTAG 
CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 
TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 



3000 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATArTATTTT 3060 
CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 
ATATAGGAAA CGTTTACGTC ACTGTAmc TCTTTAATTC TATTTAATGT TGATTCTGAG 
20 ATTGCTTTAT TTAATGGTAT TACAACCAAG CArTTATCTT CATCTATCTT AATAAATTCA 

TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTCATGA AATGATTTAC TTCTGCrTTA 
GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 
TTTATATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 
ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG •TTGACCAATA 
CTATCGCCTT CTTCTTCGAT GATATCTATT TTAATACCAT CATCAGCTGC AATTTCTAAT 
GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTC CTCCACCCAC AATAGCGACA 
TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATCTAT TTCAATATTT 



TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTITAC TGCACCAGCC GTATGAGAAC 



(2) INFORMATION FOR SEQ ID NO: 53 9; 

40 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 
GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 ■ 
3600 
3660 V 



35 w.v.™^ i^j(^t_L:AU(JC GTATGAGAAC 3720 - 

TTGAAGGGCC CAT 

3733 
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AGTATACCAT TAATTTCAAA ATGACTCATA GCAATTCATT TTATACTATA AAATTTACAT 84 0 

GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT CAAATAACGA AACGCTTTTA 900 
ATTJ^TTACTA AGGGGGAATT GATATGATTA GATACGCTAA AAAAGAGGAT TTAAACGCTA 960 

TATTAGCGAT ATACAATGAT GCCATTATCA ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACCATAGA CGAACGTGTC GCATGGTTTG AAACGAAACA ACGTAAGCAT GAGCCTATCT 1080 

TTGTATTTGA GGAAAATGGA AGTGTCTTAG GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 114 0 

GGCCAGCATA CCTATATACA ATCGAACATT CTATTTATGT CGATGCTTCA GCTAGAGGAA 1200 

AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTGGA AGCAAAAGCT AAAGGTTATC 1260 

GTGCGCTAGT TGCAGGCATT GATGCTTCCA ACAAAGCGAG TATTCAGTTG CATCAAAAGT 132 0 

TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 13 80 

ATTTAGCATT TTACGAATTA GATTTACAAG ACTAGTAATG TTTGAATCAC ATAATATAAA 144 0 

CAAGACAACC ATGTTAATTC CCTTAACATA ACAAGCCAAC ATATAAAATT TTAAACTTCT 1500 

CAGGGGAGTG GGACAGAAAT GATAAAGAGC CACTAATGAT TTATTATGTA GTGGTTCTTA 1560 

25 CACATTAGCC ACAGCTAATG TGTAcTTAAA AATAGGAATA CATGAGTAAA ACTCATGCAT 1620 

AAGAAATACT AATTTCTATA GAAAAAGTAT TTCTTTATCG TCGTCCCACC CCAACTCGCA 168 0 

CATTATTGTA AGCTGACTTT TCGTCAgCTT cTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA TATAACTTTG TAGAGCATAT TACATTGATT 1800 
TACATTGTCC CTTTTATTTA TTCTTTTCAA ATACTATCCC CATAGCTTTG ATTTAACGCT 1860 
TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC GATAATCATG CTGAGCGATA 192 0 

AAGATTTAAA TAATAGATTG ACCCACGTTC CCTCAGTCGT ATATCCATAT GTAATCGTTG 1980 
TGTTAATGAT GAATGCTATA AAGATGATTG ATAGTCTTAG CATATCATCA CTCCTTTTAA 2040 
GTTATTTTAG ATATACGGGG GCGCTTTTGC AATCACTATT TTGATTAGTA TGCATTTTCC 2100 
ATAAATCTTT CAACTTCTTC AGAGATAATT AAGAAGCATC TATCTGGTAC TAATGATCCA 2160 
GACAGATGCT TCTTTTTTAT CAATATTTTA TTGTTATCTC ATTAATTATT TTTAACCATA 2220 
4S TCTTCAGCTG TGCCAAAGAT TTTACGTTTA ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 2280 

CCACCCAATC CAGTTTCACG TAATGATGCA GGAAGGTTAC GACCAACCTT ATCCATTGCT 234 0 

TCAATAACTT CATCAACAGG GATTCTACTT TCAATACCTG CTAATGCTAA ATCTGCTGAA 24 0 0 

ATTAAAGCGT TACCCGAACC AATTGCATTT CTCATAACAC AAGGAATTTC AACAAGTCCG 2460 
GCTACTGGAT CACAAACTAA ACCTAATAAA TTACTTATCG CTAATGCCAT AGCGTGCCCG 2520 
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CATGTAATGT TGATGGTTTT 


ATCGTTGCTG 


TGCCTTCGTC AACTTCAATA 


TTTGCACCCA 


240 


5 


TGCGCTTTAA TTCTTCAACA 


TGTTTAAAAC 


GCTCCGGATA AATCGTATCA 


GTTACAAATG 


300 




AAGGACCATT TGCCATAAAT 


AATAATGGTG 


TAATAGGCTG TTGCAAATCA 


GTAGCAAAAC 


360 




CTGGATATAC TAGTGTTTTA 


ATATCAACAA 


ATTGATATGG CGCATTATTA 


TTGATGCGAA 


420 


10 


TTCTTTCGTC TCTTACATCA 


ACATTCACAC 


CTAATTCACT AAATTTAGCA 


GTTAATGTTT 


480 




CTACATGTTT CGGAACAATA 


TTATTTAATA 


TAACATTTTC TCCACATGCT 


GCAGCGATAC 


540 




ACATATATGT GCCTGCTTCA 


ATTCTATCAG 


GTATAACTTG ATACTCAGAA 


CCATGTAATT 


€00 


IS 


CTTTGACGCC ATTGATTTTA ATTGTTGATG 


TACCCGCTCC CTTAATATTA 


GCTCCCATAC 


660 




TTGTTAAGAA GTTAGCAACA 


TCAACTACTT 


CCGGTTCTTT AGCAGCATTT 


TCAATTACAG 


720 




TTTGTCCTGT TGCATAAACT 


GCAGCTAGCA 


TAATGtnAAT TGTTGCACCT 


ACGCnAACCA 


780 


20 


TATC 








784 



(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 8: 





CAATCTCCAC CAATGACACG 


TCGTCATTCG 


TGACCTCATA 


CCArACAAAA 


AACAGTCTCG 


60 


3S 


CAATCAAGAC TGTTTTCCAC 


TCAATATATT 


CATCCATTAG 


CGTAATAGAT 


TATTTGACTT 


120 




CTGTAGCTAC AAAGaTTTTA 


CGTTTTTCCC AAACGCCTGT 


cTTTTCATTG 


TAATCATCAC 


180 




AAGTAATTAA TGTTAATTGT 


TTATCTTTAC 


CTTTTTGTTC 


ATCTAGAAcT 


CCTACATCTG 


240 


40 


TAGGCTTAAC ATCTCTTATA 


CTTGTCATTT 


TATACTTACG 


TGTTTCATTA 


CCAACTTTAA 


300 




AGTACACCAT ACTACCTTTT 


TTGGCTGCTT 


TAAGATTTGT AAATTGATAG 


TTCGGACGGT 


360 




CAATGAAAGT GTGTCCTGCA 


ATTGAAATAT 


TTTGATCATC 


TAGTGATTCA 


TTTTCTTCTG 


420 


45 


CAAAGCTTAC ACCTCTATTT 


AATTGTTCAG 


GTGTTGCTGG 


TCCTGGATAT 


ACTGGTTCTT 


480 




TAATATCAGC ATCTGGAATT 


TCAATATAGC 


CTGCCACTTT 


CGATTTATCT 


TTCGGAATTT 


540 


SO 


GAGGTTTAGC TTGCTGCTTT 


TTATCTTTAC 


TCGCCTGTTC 


TTTTACATTT 


TTATCATATT 


600 


GTTCAATCTT TTCATCTTTA 


TCTTTATCGT 


GAAGATAATT ATCGATATGT 


GGTTTAGCAA 


660 




ACAAATATGC TGCCACTAGG 


aTAAGTACCA 


CACCAGCGAT 


TGTCATTAAT 


CGATTTGTCC 


720 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 240 

TTATGCTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 300 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 360 

TAATTCACGG AACGGTGTIT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTACTTTGTC 420 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 480 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 54 0 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 600 

ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 660 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 720 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 780 
TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 84 0 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGTT 900 
TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 960 

GGAGGAATCT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 1020 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 1140 

TTATAATITT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTTCAATA 1200 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 1260 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 1320 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 1380 

CGTTrrcACG AATrrrrrGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 1440 
^ (2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY : 1 inear 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 



60 
120 



55 
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(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


535: 






10 


AACAAAGCCT 


TCCAATTATC TGCGTCGGTA 


GAACAAGTAT 


TAGCAACTTT 


ATCACCTACG 


60 




CTAAACAGTC 


CTTACGATTT ATACGGCACG 


ACAAAAATGC 


TAGATATTAC 


ATTCGATTCA 


120 


IS 


TTTGAACATG 


ATGGTACAAC GTACCCTGTC 


GACTATGCTA 


CGTTTGAAAA 


TGATTATGAA 


180 


GATAATAAAG 


ATCCTGAGTT TAGACGTAAA 


AGTTTCAAAT 


CGTTTAGCGA 


TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC GGCTACATAT 


AATATGCAAG 


TACAACAAGA AAAAATTGAA 


300 


20 


GCTGATTTAC 


GTGGATTTGA ATCAGTCATC 


GATTATTTAT 


TACATAGTCA 


AGAAGTAACG 


360 




CX3TGATATGT 


— * »* » * wv^«-k X /-I X VJ 


ATTATG CGTG 


ACTTGGCACC 


AGTTATGCAG 


420 




AAATATGCTA 


AACTTTTACA ACGTATTCAC 


GGATTAGATA 


ACATGCGTTT 


TGAAGACTTG 


480 


25 


AAGATTTCTG 


TAGACCCTGA TTATGAACCA 


GAGATTTCAA 


TTGAAGACTC 


AAAAAATTAT 


540 




ATTTTCGGTG 


CGTTAAGTGT TTTAGGTGAT 


GACTATACAA 


ACATGTTACG 


TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT TGCACAAAAT 


AAAGGTAAAG 


ATACAGGCGC 


ATTTTGTGCA 


w V V 


30 


AGTCCATACT 


TTACACATTC ATATGTGTTT 


ATTTCTTGGA 


CTGGTAAAAT 


GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT AGGTCATGCA 


GGTCATTTTA 


CATTAGCTCA 


AAAACATCAA 


780 




CCATATCTTG 


AATCAGAAGC ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC 


AATGAATGAA 


840 


35 


ATGTTGATGG 


CCAATTATTT ATTTAACACA 


AGTGATAATC 


CAAGATTTAA 


GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG AACATATTAT 


CATAATATGG 


tACCCmTTTA 


TTAGAAGCnG 


960 


40 


CTTATCCACG 


GGGAGTG 








977 


(2) INFORMATION FOR SEQ ID NO: 536: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 (^i) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAAITCGT AACATFCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 444 0 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4 500 

GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4 620 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCX3TTTTG CGTAgyTTTA 4680 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4740 

TCTTTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 860 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4 920 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4 980 

TCAGTTGTtG ATTCAACCAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 504 0 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TT TT GAATCA TTTACAGATG CATTATCTTT GCTATCAGCA 5160 

2^ GATGATGCTG CTTCTGTGCT CGCAGTTGTT GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 5220 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 5280 

TTATCAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 5340 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 
TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 54 60 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 
GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 5580 
GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 564 0 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 
TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 
ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 
ACTTCTCTGA TTGCTTCC7VA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 
AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5940 
ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 
TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 606 0 

(2) INFORMATION FOR SEQ ID NO: 535: 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTCT 
AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCGTATCAA CATTTATGGG 
AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 
TTTAACATTA TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 
CX3TGGGAATT GCTGGTAAAG ATGGOGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 
ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 
ACAAGTATTC GAGGAGGCAG ATGCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 
,5 GGTCGTGGTT TCATTGGTTA TATTCTgCAG ACAACAATCT TAAAGTAACA TTTGCAGACG 
TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATACGATGTT ATTTTAGCTG 
ATGAGTCTAA AACAACGACG CGCGTGAAtA ATGtTGATGC AATTAATTCA ATGCAACCTT 
CTGAAGCGTT GAAACAAGCA ATTCTAGAAG CTGATATTAT TACAACAGCT GTTGGTGTTA 
ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 
TTAATATTGT TGCTTGTGAG AATGCTATTA TGGCAACTGA TACATTGAAA AAAGCAGTAC 
TTGATATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTGATAGAA 
TTGTACCATT ACAAAAGAAT GAAAATATAT TAGACGTTAT GGTTGAGCCA TTTTACGAAT 
GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 
ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 
CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 
3^ TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 
ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT AATAGATCXST TTCAACAATT 
CTTATTTATC TGATGAAGTA ACACGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 
AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA CGCACTCGTT 
TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 
AGAAAAATAA TTACATTAAA GAACACGGTT TAAAAGCX3TT TTTAAGTGAA TATGCTAAAG 
TTGACGATGG CTTAGCCGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 
AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 
AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTCAACATAG 
ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 
CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAACAAATT CATATTAACT 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCXmSGC GAAGGCATTT 840 

ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 

5 TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG 1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGCA TGTCGCATTA AACGATGATA 1140 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACTUVAGTaGc AAGCGAACTT GCTGATAAAC 1200 

•rrGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 144 0 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CCAAGACTAT 1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

25 AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 1620 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1680 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

^ CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 1800 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT 1860 

TAAATACAAA GTCTACXSTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2040 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 2160 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

45 GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC JAT:QPJ\ACPJ^ 2340 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 2460 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2520 
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ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTrTCAAA GCGCAATCAA 
AAATCCACAA AATTX3TAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 
ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 
TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTOGTGTTG CTAAACGTAA 
CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 
TTATGATGCT CAATTrACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 
TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 

,5 TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 
TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 
CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 

20 CCAACCACAA GTT 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 
2s LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 
TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 
3^ CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 
TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATCATATTA CAGTTACAAA 
TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 
TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 
TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 
AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATCA 
TAGCCCtAtT GATTAAGTAC CACGGtCAAC ATATCACTAT ACACGACATT GCTCAGCAAC 
TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 
CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 
CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTC 
AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 4 80 

TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 540 

GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG €60 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 720 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 840 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 
(2) INFORMATION FOR SEQ ID NO: 532: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 2: 

TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 60 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 120 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 180 

TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 240 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 300 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 360 

AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 
(2) INFORMATION FOR SEQ ID NO: 533: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 3 base pairs 
^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 60 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATCATGGA TTCATTATTC ATCCGTCAGA " 4S2( 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TCATGCTTCC 4g8C 

AGTTCCTTCT CTACTTTCAA GCGAGCCTCT CAAAATCAAA CAATrTCATC CTCTAAATCT 474c 

ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 480C 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTACAA AAACTAGAAA ArTTAGACTC 4860 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4920 

ACriTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4980 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 5040 

AACTQACX3AA GCGTTAGAAT ATTTAAGTTA TGTTCAACCA TCACCTGAAA AG1TCATX3TT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

20 AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATrGGCTG AAATCTTATA 522 0 

TTATGATGGT CAATATTTAC GTGCTACCTC TCAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ATATCAAGTT AATGGTGTAA ACTTATTCTC TCGTATGGCA GATTGTAGrr TACAAAGTCG 5340 

kAACTATAGT GATkCcGATt C9CTTATACG ATGrAATTAA TGAAGATGAA ATCACTTCAG 5400 

AAGATTATCT CAAAAGAGCC ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAAITA 5460 

AAATAATGAC TACATTACTT TCTAAAGATC CTGATTATAT TCAAgGCTAC TTCTATTTAC 5520 
aATCaTTATA TG 
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(2) INFORMATION FOR SEQ ID NO: 531: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUX5Y: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 
AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATCGATAAAA AAGAATTAGC 
GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 
ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 
AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 
TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 
ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 
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ACGAGGCGTG CACTTTATAC AAGTCCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 2820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2880 

AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 294 0 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 306 0 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 312 0 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT ATCATAAAAT 318 0 

ACCTCATGGT CATGCAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CXy^TGCTTT 324 0 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3300 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 3360 

AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 3420 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 3480 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 354 0 

25 AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3660 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3720 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 3780 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 3 84 0 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3900 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3960 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4 020 

TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 414 0 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4200 

45 TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 4260 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4 320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4 380 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 4440 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4 500 
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GATACGATTA TATTAAAACG GCTAATCATT TTTAATTAAT GATTATATGA TGCAACTGTT 


1020 




TAGAAATTCA TGATACTTTT CTACAGACX5A ATATATTATA ATTAATTTTA GrrCGriTAA 


lOBO 


5 


TATTAAGATA ATTCTGACAT TTAAAATGAG ATGTCATCCA TTTTCTTAAT TGAGCTTOAA 


1140 




AACAAACATT TATGAATGCA CAATGAATAT GATAAGATTA ACAACATATT ATAATGTTAT 


1200 


10 


CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT GAGATACCTA ACATCAGGAG AATCACATGG 


1260 




ACCTCAATTA ACAGTTATTG TTGAAGGTGT ACCTCCAAAT ATAGAAATTA AGGTTGAGGA 


1320 




TATTAATAAA GAAATGTTTA AGCGTCAAGG CGGTTACGGA CGTGGACGTC GTATGCAAAT 


1380 


IS 


TGAGAAAGAT ACAGTAGAAA TAGTATCAGG CGTTAGAAAT GGTTATACAT TAGGTAGTCC 


1440 




AATTACTATG GTTGTAACCA ATGATGACTT TACXK:ATTGG AGAAAAATTA TGGGAGCAGC 


1500 




TCCAATAAGT GAAGAAGAAC GTGAAAATAT GAAACGTACT ATTACAAAAC CAAGACCTGG 


1560 


20 


tcatgcagat ttggttggag gtatgaaata taatcatcgt gatttacgaa atgtgctaga 


1620 




GCGATCATCT GCTAGAGAAA CAGCAGCTCG AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 


1680 




ATTACAACAG TTAGATATCG ATATATACAG TCGTGTTGTT GAAATAGGTG GAATTAAAGA 


1740 


2S 


TAAAGATTTT TATGATTCAG AAACATTTAA AGCAAATCTT GATCGTAATG ATGTTCGTGT 


1800 




AATTGATGAC AGTATCGCAC AAGCAATGCG AGATAAAATT GACGAaGCTA AAAATGAAGG 


1860^ 


30 


AGATTCAATT GGCGGTGTCG TTCAAGTTGT AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 


1920' 


TTATGTGCAT TATGATCGTA AGTTAGATGG TAAGATTGCA CAAGGTGTTG TCAGCATAAA 


1980 




TGCITITAAA GGTGTAAGCT TTGGTGAAGG ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 


2040 


35 


GATTCAAGAT GAAATTCTAT ATAATAGTGA AATTGGTTAT TATCGTGGAT CTAATCACTT 


2100 




AGGTGGTTTA GAAGGCGGTA TGTCAAATGG AATGCCAATT ATCX3TTAATG GTGTAATGAA 


2160 




ACCAATTCCA ACGTTATATA AACCATTAAA TTCAGTAGAC ATTAATACTA AAGAAGACTT 


2220 


40 


TAAAGCAACA ATTGAACGTT CTGATAGTTG TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 


2280 




ACATGTCGTA GCATTTGAAA TAGCAAAAGC ATTATTGGAA GAATTCCAAT CAAATCATAT 


2340 




TGAGCAACTT AAACAACAAA TTATTGAGCG CAGACAATTA AATATTGAGT TTTAACAACA 


2400 


45 


AGAACAATTG AGGTGTAATC ATGAAATTAC AAACAACATA CCCTTCAAAT AATTATCCAA 


2460 




TATATGTTGA ACACGGTGCA ATTGACCATA TTAGCACGTA TATTGATCAG TTTGATCAAA 


2520 


SO 


GTTTTATATT AATTGACGAG CATGTAAATC AATATTTTGC TGATAAATTT GATGATATTT 


2580 




TATCATATGA AAATGTACAT AAAGTTATTA TTCCAGCTCG TGAAAAGACG AAAACAnTG 


2640 




AGCAATATCA AGAAACATTA GAGTATATTT TATCCCATCA TGTAACTCGT AATACAGCAA 


2700 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2760 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATCGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2940 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3 000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 
(2) INFORMATION FOR SEQ ID NO: 530: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 120 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA l8 0 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 24 0 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 300 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 360 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 420 

GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 480 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 54 0 

GTAAATTACT TGAAAAATTT TATAGTTTTT kGGTAACACG TAtTaAAAaG AGAGGAATAT 600 

45 TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTTGTTA MKATGPJ^gPJ^ TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 72 0 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 780 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 84 0 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 900 
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TATATGATGT 


TCAATATTTT 


CATGTCCATP 


'1 "1 *l''l*R*T**P1S Ti TV 

J- i- 1 I A i i/AAA 


CjAAACATGAT 


TGAAATCCTT 


900 




AACTTTATCA 


CGTGTATTTT 


CCAAAGCTAA 




T^i^r* ft ft ft /~<r^ n #T« 


AAACATGTCC 


960 


5 


TTCTGGTACT 


TGTTCGGCTA 


AAAATAAAGT 


GTP ATTYSrTY^ 


X XA(.(.V:y(.»AAG 


TTGCGTCTAC 


1020 




AACAATACTT 


TCTGGTGTTA 


TATGTTGTTT 


A AT A 21 PTY^TT 
X >xnVj X V7 X X 


n^T'«p^ ft ft ft ft ft ^ 
X X XVjAAAAAG 


GGAGTATACG 


1080 


10 


TTCTAATTTC 


ATTGCTTCAC 


CTTAGACTTG 


TAA PG PAAG P 


Z***! •« I ft 1* ft ft ^ ft 

^ X X V7 A X AACj A 


ATTT CTACGT 


1140 


GCTAATTCAG 


CATCGATGCC 


ATTTAAT A CT 


TPPPATTTaT 
x^V..l._f^X X X/\X 


T'ft ft r^R r^*T»^^ft 
X AAL-AL. rCCA 


CATTGGACCT 


1200 




ACCATGATAT 


CTATTGGACC 


AT CAC CGGTA 


ATTP'i'^nTV^ TV h 
X X l\jt/v\ 


v-vjAa (JATTTC 


AGGGGGAATC 


1260 


IS 


ACTTCTAATT 


GGTCCACAAC 






VjACjTCATAAA 


AGTTAATAAA 


1320 




CCTTTATCGT 


ATTGTTTTAC 


CATCGGTfSTA 


X X X 1 /\/i^.A 


ft 3* T'f^ ft ft / » n 

AA X CvAAGTAA 


ATGAATTTTA 


1380 




ATACCTTGTA 


CATCCATTTG 




T*/~*J^ (^T^ TV r^/^ 

X wiOLZAvy X AGC 


CATCATCATG 


TCATAGTCTT 


1440 


20 


CGCCAGGTAA 


GCCATTAATG 


A Tr; Tf^TY^ X Zi 
nxwxoxvji 


AXAvJAi XxjAX 


ATTATGCTTA 


CGTAATTTTG 


1500 




CCACACCATC 


ATAATAAGTT 


TTCAT ATP A T 

V— Xrt X \_nX 


v7Vj*jV_>\vAj A X X 


UA X r AAATCA 


GATGTTGACT 


1560 




GATGGATTGT 


TTGT AGTCC T 


AATTPAAPPP 


AX AAv^XAXCyx 


TCX3 X XGATTC 


AAATCTGCTA 


1620 


25 


AATATTCGAC 


AACATCGTCT 


f^RXAriAr'Af^T 

VJvJ X rWjJ-\\^,J-\\j X 


C^(.3\jA^uCVj X 


ACCAATAGAT 


AATCCCACAA 


1680 




CACCCGGTTC 


TTTAAGTACA 


Gr5TT*P(^ 21 2i 
VJVJ X X X X 


TTTCTTTTAA 


TACTTCAACC 


GGTGCATGTG 


1740 


30 


TATTTGTAAA 


TGCCTGAAAA 


T AAflP A A T a T 


AX X X Xuv^X X V_ 


CjTCjCCAX X X C 


TCATGCATCT 


1800"- 


TTTCTTTAAT 


TTCTTTAAAT 




X X\3AMX X\j(_ 


ft ft *T»*n ft r"i 
Av- G ATT AC CT 


GCAAAGTCTC 


1860 




CGCTACCTGC 


AGCAGAACAA 


AATGT A P 21 TT* 


V.-/\l-ULA X Vj X\7V_ 


T*ft ft ^T*/^/^^* 

TACAGTG C CA 


TCGCGGTTAG 


1920^ 


35 


GACAGTCAAA 


CCCGCCATCC 


AATGCAAPTT 


Xi/UUIXAX XXX 


n •'1 '/""P/^r^ft ft ftn^ 
X XVjX V-k.-AAAT 


TTATTTTTTA 


1980 




AATGGTAATT 


CCATGTGTGA 


TAAPGTTTf^T 

x'v^^—VJX X X X 


1 X X l—rvwvivv^ 


^•Tft T**T«/^^ ft ft ft 

uXAX rUGAAA 


TGATTGCCCA 


2040 




TATGTCATTT 


TCCTTTCTAT 


AAAAAAAGAG 


TTPT A 21 f^T a 
X X^Xn/W^XA^ 


AGATTTTAAC 


ATATTTTAAT 


2100 


■40 


GTTATAGTGT 


TTATTATAGT 


TTGACAAAAA 


A<? A^S A f2 AriT! A 


& r^Ta Tv^ ft ft ft T* 
AL, X AXoAAAl 


ATGAATATAC 


2160 




CTAAATCAGT 


CTGGTGGCTA 


GTAATTGGPA 


TfV^Pf^TT A Zl 21 


X /\ X X AV. X Vivy X 


TCCAG X X X X X 


2220 




TGTGGCCTTT 


AAATACAATT 


T AT ATG AAA P 


21 &^ft ft t 

^^M.v7Arl^ X X \3\3 


AAAAAGTTTA 


ACTGTTGCTG 


2280 


45 


GTTTAGTGCT 


AATGATAAAT 


T CATTT! YSG PA 


xo^JX X/lX XljVk3 


AAACTTATTA 


GGTGGTTCAC 


2340 






ATTAGGTGGA 


TACAAGACGA 


TTTTAATTGG 


AACTTTCACT 


TGTCTTTGTA 


2400 


SO 


GTACAACGCT 


ACTTAATTTC 


TTTCACGGGT 


GGCCTTGGTA 


TGCTGTATGG 


CTTGTAATGT 


2460 


TAGGGTTTGG 


TGGCGGAATG 


ATTATTCCTG 


CGATATACGC 


TATGGCTGGA 


GCAGTGTGGC 


2520 




CAAATGGCGG 


AAGACAAACG 


TTTAATGCGA 


TATACTTAGC 


GCAAAATATT 


GGTGTGGCTG 


2580 
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TCAATCATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 234 0 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATCAACTGAC AA/ATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 252 0 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 264 0 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 2760 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2820 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2870 
(2) INFORMATION FOR SEQ ID NO: 529: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 

2S <D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 
CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT A ATT TTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 240 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 3 00 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 360 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 4 80 

^5 ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 540 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 600 
TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 
TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 720 
TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 780 
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75 



20 



GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 
TTAGGTGITT TTATTATrTT ACTTGCATCT ITTAGIX^ GTrrGTAITT TACTTTCCAG 
AAAAAATACA TAGAAAAATA CGGCITCATC GCTTTCACAC TATaTACAAT AATGGCAAGC 
TCACCATTTA TGCTTATTTT TATTCCIX3AA ATCATCAACG ATATACACGG CGCCACTTTT 
ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTrc 
CTTGCTTATA TTGTGAAGTC AGTTGGTCTC TCTCATGCAA CAATGTCTCT TTATTTAACA 
CCTATCG7TT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCITGCT 
ATTATAGGCG GAATTATCAC TCTACTAGGC GrrAGTTTAA GTAACrrCIT TCAAAATACA 
TAATTAtrrCC AAGTCCCGCA CCTCAGAAOC CAAAAACATT CGAGTGATAA AATTTTAAAA 
ATCAAAAATA TAAAAA-^AT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 
CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 
ATTACAGTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 
ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT 1X3CTCCAGCT IX3TTTTTTAA 
TAAGGGATGG TCAAA-^GAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 
GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 
AAATGGAGAA CCACTTTGAA TGTITGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 
TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 
CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTITAGT CACAGACGGA GACTTCACTT 
TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 
ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 
ACTGGAAAAC GATGTTTGCA TTTGAACCXST GGCATTCTGC AATGGAAATC CGTCGCTATC 
TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCrTTA AAATTCACTA 
AATATAATCA ATATGAATCT TTAGTATTAC CTAlXKJ-nTGA ATATTTAAAA TCGCATCGGG 
TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATCTTACG ACAAGTCAAA 
AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTCACTA 
TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATCGTGATA 
ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 
TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTCATAAGTT TTCCCAAAAT ATTCCTAAAA 
AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 



35 



40 



54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT 
ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT 
TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC 
GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT 
AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT 
GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG 
CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA 
AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC 
GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC 
CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT 
AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA 
GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA 
aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG 
TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC 
GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA 
AAAGG 

(2) INFORMATION FOR SEQ ID NO: 52B : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TTCACAAA/VA 
TATTTAGCTA 
AAAAAGTATA 
TTATTTATAA 
ATGCATTTAC 
CGATTGATTC 
GTGATGCGGT 
AAGATTACGC 
AACCGAGTGT 
CAGGAAAAGT 
AAGTGACATT 
AGAATAAAGC 
GTGACAGTAA 
ATTGGAAAGT 
AAGAAGCTGC 



TAAAGCCTTG 
CAGATAACAT 
GAAATAAATA 
GCTGTTTTTA 
AAGTATGGCA 
AAAAAATAAA 
CAAAGCATTA 
TGTTACTGAT 
TGATGGTGTT 
CGTTTTAATC 
AAGTAAAGAT 
GAAAAATCTT 
TAAATACGTT 
TAAAATTGAT 
AGAAACTGGT 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1?00 
1260 
1320 
1380 
1440 
1500 
1560 
1565 



40 



45 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 
GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT 
AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC 
TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA 
AATCAT-TTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC 
TTCCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT 
TrTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT 
TTGATTAGTG CAGGTATAtC TGGrATTCyA GTCTCTACAA CGCCTATTTT 



TTTAATATAG 

TACTTTTCTA 

GATTGCGTTA 

AATAATTCTC 

CCCTGTTATT 

TGGTGAAaCT 

TTCTAGTGCT 



60 
120 
180 
240 
300 
360 
420 
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AGTTTTAATT TCAACATCAT TTGTraAGTA CATTTTTGCT AAAGCTITAA TTTCAGAGTT 
AGTTAAAT-TA TGCTrTGCAT TTTTACCTAC AATTTCAATC ACGTTATCAA GTTTATCAAT 

5 

AGAATCAACT TCCTGTGCTT TTTGGAATAA AATCTTAATT AATTCCATTT GACGTTGTCC 
ACGTTTTAAG TCTGAATCAT GATGTCTAGT TCTAGCAACT GCTAAAGCCT CATCACCATT 
TAATTTTTGG TACCCTTTTT TAATTTTAAT CTTACCAGTA TCATCTGTGT TAGGTTCATT 
TAAGTCGTAT GGCACATCAT AGTATATGCC ACCAAGCTCA TTTACAGCCT CGACAAATGC 
TTTCATATTG ACTCTCACAT AATAATCAAC AGGTACATTC ATCGTAGCTT CTACCGAATC 

;5 CATTGCGGCA ATTGGACCAC CATATGCATG TGCATGGGTA ATCTTATCGT AATAGCCAAC 
TTTAGGAATG TAGCTGATAG TATCACGTGG AATACTAAGC ATTCTAATTT GATGTnTGA 
TTGATTAAAA GTAGTTAAAA TCATAGCGTC TGATCTAGAG TGTTCAGCAT CCTGTCCTTT 

20 TTTTCTTCTT CCATCGTTAT CATCGATACC TAAGAAAAGA ATAGAGATAG GTTGTTCTTC 
GGGATGACTT TATTATCTCT TAAGTTGGAT TGACGTTAGC ATTTTTGCTG TCTTGAGAAG 



2S 



30 



40 



45 



A 

(2) INFORN5ATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1140 
1141 



35 SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAnCAGTTG TTGAAAATTC GGCGCCAACA 
AGTAAACmGG CmCaGGCaGC mATamCCCCa TATTATAcTT ATAATGGTTA TATTGGTAAT 
AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA TTAAGTATGA TAATX3TGAAA 
TTCAATGGTA TTAAATTAGC TAAAACGAAT ACGATAAAAA AAGTAGAGAA ATATGATCAA 
ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC GAAGCAAGTC AATTGCAATT TGTAGTTAAA 
AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA AGGACTTGAA AAAAGAAAAT 360 
GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC AAAATGCTAA AAAGACATTA 420 
GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTCGAAG TAACAGTTGG ACATACACCA 4 80 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGCTAG AGTGTGAGGA GTGATACCGC 
ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC TTACATAGTT GTTTGAAATT 



60 
120 
180 
240 
300 



54 0 
600 
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GCACTAAACT GTATCCATTT GAATGAATGC CACTTGACGC 
CCTCTTTCAC TTCTGAACCA TCTACATAGT CATCCTTTTC 
CAGCTACATC ATATTCGCCT TCGTGATACA TTTCACCCAT 
TAAGTGCAGT ATTCGTTTCA ACACATGCAT CACTAATACC 
CTTCAGGAAC AACTTTGTTT GTAGCAATGT AATCTAAAAA 
TCGTTAAAAT ATCATTAACA CACATTGCGA CTGCATCGAT 
CATAGTCGAT AGCTAGTTTT AATTTCGTAC CTACTCCGTC 
GCGCTGtCAT ATTTAATTGT GATAAATCAA ATGTAGCACC 
TAACTTCTTT ACGCATCGTA CGtTTAACaw GaCtAGaCaT 
CAGCATGAAT ATTTACACCA GATTGTTCAT ATGCTTTAGA 
TCAAAT^GT GTTTGTTTTT AGAAATATAT TGCTTTTGTC 
TAATTTGCTT CATAATCATA TAACCCTGCA GGATAATCTC 
AAGCCACTAT ATGGCGCGTC ATAATCTAAA CCAATTGATT 
AGATATGCTA ATGAATCAGC GCCAATATAA TCTTTAATTT 
GAAATTAATT CTGCCGTAGT TGAAACATCG ATTCCGTAAA 
GGTGATGCTA TACGCACATG CACTTkATTT GCACCAGAAT 
CGTCGAATTG TCGTACCGCG nACAATGGAA TCATCAACAA 
(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



AAGCCCTATA 
AACTGCTCCA 
TTCAGCAGTC 
TTTAACAATT 
ATATAATGGT 
ACCTATCGAA 
TGTTCCAGAA 
GAATCCACCT 
TCtTTCtACA 
CATTTAAATT 
GATGACTTAA 
CAGTGAAACT 
CAATTAACCC 
CTTCAGGTGA 
AACTTGGAAA 
CkTTTAACAT 
GACTG 



ACAACTTGTC 
ACAGCAAATC 
TCTCCACCGA 
TGTTCAATAA 
TCTGCACCTG 
TCATGTTTAC 
ACTAAAACTG 
AAACCACCGA 
GCTTCATAAC 
TCCCTCGCTA 
ATGCGCTTTA 
TTCTACACAT 
ATCTACAGAT 
TTTGCTTGCA 
CATAAATTCC 
TTTCACAATG 



900 
960 
1020 

loeo 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1845 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 
TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT 
TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT 
AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT 
CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT 
TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT 
ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT 



TTTGGTTATT 
GATCTGTAGC 
CGTTAGATGT 
CTAAAGGCGT 
CATTTTTATC 
GAATGCTTTC 



60 
120 
180 
240 
300 
360 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 



£ 



10 



TTGTTGTCAG 


TTTAACGCAA 


CGTGTTACCG 


ACTTTTCAAG 


TACACAAcAA 


TATATATGTC 


bO 


TATTCCATTT 


TTAGCCCcTG 


CCATTTTCAT 


CATTGGTGGT 


ATTATGTTGT 


TTATTTCAAC 




ATTTAATAGC 


TTAGATGAAA 


CTGCTGAAAA 


TAATAACAAA ATAAAGAAAC 


TAATGATTAA 




AGGACTTATC 


ATTATTAACA 


TTTCATTTAT 


CGTTATGATG 


GTTTTAACAC 


CATATTGGTA 


240 


CTTGTATTTA 


ATCGTCTATC 


TTATTTTCTT 


GTTGTTTTTA 


TTGTGGCAAA 


AGGTTTATAA 


300 


ATTTTAATAC 


CAAAACTATT 


AAACACTTCT 


GATATTCTTA 


GTTCAAAATA 


TCAGAAGTGT 


360 


TTTATAGTGT 


TATCTAGTTC 


AGATAAATAT 


TTCCTTACTT 


AAAAAAACGC 


CCTCCTCTTA 


420 


TTTTGACCCC 


nAT 










433 


(2) INFORMATION FOR SEQ ID NO: 525: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 5 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 





(Xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


525: 






30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 




GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA ATACCCATAT 


54 0 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




T'i"rrrGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 


50 


GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA 


ATAGAAACTT 


720 


ci-rrriTCAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

TTAGAAGTCA AATCATTAnT GGCGTCnTAT CGAGTATACT ATTAACTTCA ACTATTTTAG 60 

CAATTGCATA TATTTTAATG TGGTTTAACG GCCATATGAC aCTAACTTTG ACCTTAACGA 12 0 

CAATAATTAC AAGCTGTTTA ACCTTATTAA TATGTAGTAT TTTTATTAAT CCACTTATAC 180 

AAAAAATTAA GCAGTTTAAT ATAAAAACTA AGCAATTTGC TAACGGAAAT TACGCAAGCA 24 0 

ATGATAAAAC GTTTAATTCA CCAAAAGAAA TTTATGAATT AAATCAATCT TTTAATAAAA 300 

TGGCTTCTGA AATTACGCAA CAAATGAATC AAATTAAATC CGAACAACAA GAAAAAACAG 360 

AACTGATTCA AAACTTAGCC CATGATTTAA AAACACCTTT AGCAAGCATT ATTTCATATT 420 

CTGAAGGACr ACGTGATGGT ATAATCACTA AGGATCATGA GATTAAAGAG TCATACGACA 480 

TATTAATTAA ACAAGCAAAC AGATTATCAA CATTATTTGA TGATATGACT CATATTATCA 54 0 

CTTTAAATAC AGGTAAAACA TATCCCCCAG AATTAATACA ACTAGACCAA TTACTTGTAT 600 

CAATATTGCA ACCATATGAG CAACGTATCA AACATGAAAA CCGCACATTA GAAGTGAATT 660 

TCTGTAACGA AATTGATGCA TTTTATCAAT ATCGAACGCC ACTTGAGCGT ATTTTAACAA 720 

ACTTACTTGA TAATGCGCTA AAATTTTCAA ATGTTGGTAG TCGCATTGAT ATTAATATTA 780 

GTGAAAACGA AGATCAAGAT ACTATCGACA TTGCTATTAG CGATGAAGGT ATTGGCATTA 840 

TACCAGAACT ACAAGAACGT ATATTCGAAC GTACATTCAG AGTAGAAAAC TCTCGTAATA 900 

CAAAAACGGG TGGTTCTGGA TTAGGCTTAT ATATAGCTAA TGAACTCGCG CAACAAAATA 960 

ACGCAAAAAT CAGTGTAAGC AGTGATATAG ATGTAGGAAC TACGATGACT GTAACATTAC 1020 

ACAAATTAGA CATTACGTCA TAATCCGATT TATTTATAAA ATAAAATGCA AAGACTAAAA 1080 

AGAAGCTCCC ATTAATGAGG GCTTCTTTTT TTGTTTATTT AGAATAAACT TTATGGGTAT 1140 

CCTTCTCATC ATTTTCAAGA CTTGAAAGAT TTGTAGCTTG AATAATATAT TTAGGACGTG 1200 

CCTTAACTTC ATAATATATC CTGCCAATAT ATTCACCTAC AACACCAATT GAAATTAACT 1260 

GTATGCCGCC TAATAATAAA ATAGCTGCAA TCGTTGAAAA ATATCCCGGA ATATTAACAC 1320 

CAGATATCAT AATATTGATG GAATA7VATAG A 1351 

(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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TGTTAACAAT GGTGCGATTT ATATTAATGG TGAGAGACAA 
AGCACCAGAA GATAAAATTG ATGGCGAATT TACGATTATT 
CTTCATGGTT AACTATCAAT AAATATAATT GCATAGCTAA 
ATTCATTCCT AAGAATGTAA TGAGTAGGCT CTTAATGTAC 
TAAAGAAATC TCTATCGCCT TGGCCAGGTG ATTGACGTTC 
GTTGTTCTTT TTGTTGTTTT AATTTCACTT TAACTTCTTT 
CGGTAACAGT GACTGATTCA CCAGGTTTTT TATTTTCATA 
CATCATCTTT AATTTTCTTG CCATCAATTT CTGTAATAAT 
TATCACTATC AGCTTTGGCG ACATAAATAC CGTCTTCTCT 
GCTCTTCTTC AGGAATATCT TTCAAATTAA TTAAACCAAT 
TTTTACCATG TTTTACAAGT TGTTCAATTG TTACTTTAAC 
ACCCGATACC TTCAACTTGT GTCGCAGCAA TTTTCATTGA 
CATTAATATC TACTAATGCG CCACCTGAGT TACCTGGGTT 
GAACGCTAAC TTTTGTATTG CCACCAGTTG TCTCAGCGTC 
AAATGATACC AGATGTTACA GAGTTAGCAA ATTGTAATCC 
ATACGCTATC GCCAGTTTGT ACTTTTGAAG AGTTGGCAAA 
TTGTATTTTC AATTTTAAGT ACAGCAATAT CAGTTACTGC 
CrrrTAACTTG TTTTTTATTA TGTAATTGGa CTCTAATTTC 
GATTGTTTGT AACGATATAA GCTGAATTGT TGTTTATTTG 
CTCCAGCTTC AGATGGTTTA GATGATTTGC CyTTTAATAA 
TTtGCATGTk AATAACTCCA ACAATTGTAG GGGAGACAGA 
TACCGAATTT CTTGCTTTGA CCGTCTAATT GATTGCCACC 
CAGTTGAACC ATCTTTATTT AAAATTGTAC TATTTAATAC 
GAAGTGCACC AATAATTCCA GCAATCAATG CAACGATGAC 
TAGGTCTTCT GTATCTAGGT GTTTGGCTAT GGTTTGrTGT 
AATCTGACAT ACTTAACCTC CATTATATGA TTTATATATG 
TATAATT 

(2) INFORMATION FOR SEQ ID NO: 523: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 51 base pairs 



CAAGATGTTA 
CGTCGCGGTA 
ATAAATTAGA 
TTTTCTGTCT 
TGATCGGCTT 
TGTTTTACCA 
TAAATAGCTT 
ATCACCTTTT 
ATCAGTATGA 
ACCAATCGAA 
TTCATTACTT 
GTTAATACCA 
AATAGCAGCA 
AATCGTACGT 
TAATGGGTTA 
TTGAATCGCT 
ATCTTTACCA 
ATTTGCGCCA 
aTAGATAACA 
GTCGTCTACA 
TTTTATCATT 
TTTATTATTT 
TTTGCCTATA 
TGTnTTAAAC 
AGAATGATCT 
CTTTAATTAT 



ATTATGCTTT 
AGAAAAAATA 
GCCTACTCAT 
GTAAATTATC 
TGACGTTTTG 
TCACGGATAA 
CTTAAATCAA 
TTAAGATCAA 
AGTTGCTCGC 
GGGCGGTCAA 
GGAATAGCAA 
ACTAAATTAC 
TCTGTTTGAA 
TCGCTTGCTG 
CCCATTGCGA 
TTAATACCTT 
ACTAATTTCG 
TCAATAACAT 
CCTGAACCTA 
CTTGATGCTT 
TCATGAACGG 
GTTGTCTGAA 
CCAAGTACTA 
CATGGAAATT 
GTATGATTAA 
GTCTTTTnTT 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
144 0 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1987 
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GTTAGGTAAA ACATTTTATC AAAACTATAG AGATGATGTT GAAAGTTTAT ATAGTAAGTT 420 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAT^GCAG TTAACAAAAG 4 80 

GATCTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 54 0 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 60 0 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA 66 0 

AAGAAGCAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 72 0 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 780 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 84 0 

TAAACAACGT ATTGACAACG AAAACGACAA GAAACTTGTG GTTTCTGCAC CAACAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 960 

GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 1020 

TGAAAGTCAT GATTTCACAA CAACGCATCA ATCACCAACA ACTTCAAATC ACACGCATAA 1080 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 114 0 

25 TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 1200 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC ATAGTTATAA 1260 

AGACCGAATA AATGCACAAC AAAAAGTAT^ TACTTTAAGT GAAGGTCATC AAAAACGTTT 1320 

^ TAATAAACAA ATCAATAAAG TATATAATGG GCA 12 S3 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

^5 ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

50 TGATGCAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 360 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATCCATT 
AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 
ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 
ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AGTCCATACA 
AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATGCAT 
GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 
TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 
TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 
TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 
AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 
TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 
GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 
CATTTGTTAA CCTCCTTGAA CATTTAAATT GATTCAACTT AAGTTTAACT TATTCATACA 
25 ACTTCGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 
TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTlTr 
30 ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 

GTCATGTTAG CCAATTTTTA AG 
(2) INFORMATION FOR SEQ ID NO: 521: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRTOTOEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
. AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 



TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 
GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA AriTAGATTA 
so TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 



120 
180 
240 
300 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 1620 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1680 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGTGA 18 60 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 192 0 

TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2 04 0 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 234 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAA/UVTG TGCAAGTTGG 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 276 0 

SO ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT APiAGATP^TPiA 288 0 
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TAGTAGCTCC AGGATTACAG ATTAATTCGT CTTCAATTAA AGGACTAAAA GAAAATATAG 
GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 

rrrcTAATrr taaacaagga aatgccattt ttacgcatcc aaacactcct ataaagtctg 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 



2940 

3000 

3060 

3120 
GT 

10 3122 



(2) INFORMATION FOR SEQ ID NO: 52 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 982 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



^° SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

ATAnAGATAT AnATnAATAT ATTGAGGTCA AACGATGATA ATTAAAATTT TAACAATTCT 
ATTACTACTT TGTATATTGA GCTATTTGGT TACAAATAGA AAGAAGCCTT TTCTGTTCTT 
AAAGACACTC TTTATGGGTG TGGTATTTAT CTTTATAGGA TATATTTCAC TGGCAATATC 
TGCCGTAATT ATTTATGGTA TTATTCAATT TATCACAATT GATTTTSGTA GTTTTTTCTT 
AATGGGTATT ATATTGATCT TGATTTCAAG TATATTCCAA TTATTTATAG TTAGATTACT 
TTTTAGAAAA AAGAATGTCG ATTTGACAGA GGTTGTCGTT TTAsAsCATT TAATTCAATG 
GTTCTTAGTT TACTTTGCGA TCTATCAAGC AGTAAATGAA AAAATCGACA TTAATGATAT 
^ TAATATCGAC AATTTCCAAT CTGTCTTTTT TGACGTCTCT AATTTGAATT TAGTAATTCT 

ACCAACGTTA ATCATTAGCT GGGTCACAAT ATTTAACTAT AGAATCAGAA GITACAAATA 
AAATCTATGA GATTATACCT TCAGACACCA ACATTCAAAT GOTCTen-TT TTGrTOTCTG 
40 GTTTTATTTT TGAAATTCGA PAPAGTAGAG GCATGAATTT TTTGACTAGT GTATAAGTGC 

TGATGAGTCA CAAGATAGAT AGCTATATTT TGTCTATATT ATAAAGTGTT TATAGTTAAT 
TAATAATTAG TTAATTTCAA AAGTTGTATA AATAGGATAA CTTAATAAAT GTAAGATAAT 
AATTTGGAGG ATAATTAACA TGAAAAATAA ATTCATAGCA AAATCTTTAT TAACATTAGC 
GGCAATAGGT ATTACTACAA CTACAATTGC GTCAACAGCA GATGCGAGCG AAGGATACGG 
TCCAAGAGAA AAGAAACCAG TGAGTATTAA TCACAATATC GTAGAGTACA ATGATGGTAC 
TTTTAAATAT CAATCTAGAC CAAAATTTAA CTCAACACCT AAATATATTA AATTCAAACA 
TGACTATAAT ATTTTAGAAT TTAACGATGG TACATTCGAA TATGGTGCAC GTCCACAATT 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 
ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 
AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 
ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 
CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 
ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 
TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 
AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 
CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 
GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 
ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 
ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 
GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 
AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 
CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 
GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 
TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 
CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 
ATCTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 
TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAACCCAG TCCGTTCATA 
GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 
ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 
TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 
TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 
ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 
ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 
GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 
ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 
ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 
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ACAiiiinA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 
TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATr 
ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 
ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 
GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGmTTAAA 84 0 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 
(2) INFORMATION FOR SEQ ID NO: 519: 

IS SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 
TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 
ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 
AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 
AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 
GTTCCATACT TCATCCTGTT GATGAGArTC AGTCATTCAA TAAAGAAAAA AATAAAACCT 
ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 
AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 
GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 
ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 
40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTCACATTA 

AAAGTTGGGT GAAACAAACA AGGCATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 
TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 
GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 
CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCITA 
ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 
ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATCrTTGGTT 
TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 378 0 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3840 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3960 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4 02 0 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 414 0 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 426 0 

TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4320 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 43 80 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 4440 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4 544 
(2) INFORMATION FOR SEQ ID NO: 518: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

^5 TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 240 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 4 20 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 
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AGATAGCATT AGATAGGGAT AATCATCTTT GTCAAATGTG TTTACGTGAA GACATAGTAA 192( 

CAGATGCAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 198C 

ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT GCAAATCATA 204 C 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 2X0C 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGCC 216C 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAACAAAA 22 2 C 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 2280 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 234 0 

ATGATAGAGC ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATCC ATTAAGCATA 24 00 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG TTTAACTGCA 2460 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCX; GTGACTATTA AAGTTTTAAA 2520 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 2580 

AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 2640 

AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATTCGATTCA TAGAAAAGTT 2700 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTGTATTA CAACCATGGC AACATTTTAT 2760 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2820 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATATCTGGTG TTGCTAACTA 2880 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2940 

^ ACAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3000 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3060 

GCCACAGGCT TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

40 TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGGC 3180 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 324 0 

TGTTAATATG GTAGAAGCGG GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3300 

4S TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 3360 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 3420 

AAAGGCTAAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 3480 

SO 

TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 3540 

TATTTCCTTA GATGAGTTGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3600 



25 



30 
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GGAACACCTA 


AAGAAAGAAA 


TGATGCATTA 


AACACAGAGG 


CTGATATCTA 


TGTAACCAAT 


120 




AAAGAAAATA 


CTAAATGGTT 


ATGCGATCAA 


TATAAAAAAG 


AATGGCCATT 


TGATATGGTT 


180 


5 


GTAATTGATG 


AACTGTCTAC 


ATTTAAAAGT 


CCTAAGAGTC 


AAAGGTTTAA 


ATCTATTAAA 


240 




AAGAAATTAC 


CACTCATTAA 


TAGATTTATA 


GGATTAACAG 


GAACACCTAG 


CCCAAATAGT 


300 


10 


TTACAGGATT 


TATGGGCTCA 


AGTTTATTTG 


ATAGACAGAG 


GCGAAAGACT 


TGAGTCTTCA 


360 


TTCAGTCGTT 


ATCGAGAAAG 


GTACTTTAAA 


CCAACACATC 


AAGTTAGCGA 


AGATGTTTTT 


420 




AACTGGGAGC 


TAAGAGACGG 


ATCTGAAGAA 


AAGATATATG 


AACGAATAGA 


AGATATATGT 


480 


IS 


TTAAGCATGA AAGCGAAAGA 


TTATCTAGAT 


ATGCCTGACA 


GAGTTGATAC 


TAAACAAACA 


540 




GTAGTCTTAT 


CTGAAAAAGA 


AAGAAAAGTA 


TATGCAGAAT 


TAGAAAAAAA 


CTATATTTTA 


600 




GAATCGGAAG 


AAGAAGGAAC 


AGTTGTAGCT 


CAGAATGGGG 


CATCATTAAG 


TCAAAAACTA 


660 


20 


CTTCAACTAT 


CTAACX5GTGC 


AGTTTATACA 


GATGATGAAG 


ATGTAAGACT 


TATACATGAT 


720 




AAGAAGTTAG 


ATAAGTTAGA 


GGAAATTATA 


GAGGAGTCTC 


AAGGCCAACC 


AATATTATTG 


780 




TTTTATAACT 


TCaAACATGA 


TAAAGAAAGA 


ATACTTCaAA 


GGTTTAAGGa 


AGCAACCACA 


840 


25 


TTaGAGGATT 


CAAACTATAA 


AGAACGTTGG 


AATAGTGGAG 


ACATTAAGCT 


GCTTATAGCA 


900 




CATCCAGCAA 


GTGCAGGGCA 


TGGATTAAAC 


TTACAACAAG 


GTGGGCACAT 


TATTGTTTGG 


960 


30 


TTTGGACTTA 


CATGGTCATT 


GGAATTATAC 


CAACAAGCAA ATGCAAGATT 


ATATAGACAA 


1020 


GGACAAAATC 


ATACGACTAT 


TATTCATCAC 


ATTATGACCG 


ATAACACAAT 


AGATCAAAGA 


1080 




GTATATAAAG 


CTTTACAAAA 


TAAAGAACTA 


ACGCAAGAAG 


AATTGATGAA 


AGCTATTAAA 


1140 


35 


GCAAGAATAG 


CTAAGCATAA 


GTAATGGAGG 


TATAAGATGG 


GAAAGGCATC 


ATACGATATT 


1200 


AAGCCAGGTA 


CATTTAAATA 


TATTGAGTCA 


GAGATATATA 


ACCTACAAGA 


GAACAAGAAA 


1260 




GAGATAAATA 


GATTGAGAAT 


GGAGATACTT 


AACCCAACGA 


AAGAGCTAGA 


CACTAACATT 


1320 


40 


GTGTATGGAC 


CGTTGCAAAA 


AGGTGAACCA 


GTTAGAACAA 


CTGAACTAAT 


GGCAACAAGG 


1380 




TTATTGACTA 


ATAAGATGTT 


ACGAAACCTA 


GAAGAAATGG 


TCGAAGCAGT 


TGAAAGTGAA 


1440 




TACTTAAAGT 


TACCTGAAGA 


TCATAAGAAA 


GTAATTAGGC 


TAAAGTATTG 


GAATAGAGAT 


1500 


45 


AAGAAGCTAA 


AGATAGAGCA 


Aft L A\jVj AL» A i 


GCATGTCACA 


TGCATCGTAA 




1560 




ACTATACGAA 


AGAACTTTGT 


TAAAGCGGTA 


cGwtATCATG 


CAGGTATCAA 


ATAACATTGT 


1620 




GCAAAGATTG 


TGCAAAAGGC 


CTACAAATCT 


GTAGTAATAT 


GATAGTATCG 


GAAAGATGTA 


1680 


SO 


TAAAGTTATC 


TAAAAGTTAT 


ACGACACAAG 


TACACGAGGC 


ACATCGCTAT 


GCGtGTGTCT 


1740 




TTTGTTATGC 


AATCAAAGAG 


GTGTAAGAGA 


TGACCAAGCA TAATAACATT 


TATAAGCATG 


1800 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 4 20 

TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 4 80 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 54 0 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 600 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 660 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 72 0 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

,5 GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 

(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 
GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24'0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACTTGT 300 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATCTCATTA 4 20 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 4 80 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 



30 



3S 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 
^ (2) INFORMATION FOR SEQ ID NO: 514: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 60 
TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 
20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 24 0 

AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 300 
GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 3 60 

TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 4 20 

TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

30 

GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 587 
(2) INFORMATION FOR SEQ ID NO: 515: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 120 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

SO 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 24 0 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 300 

55 
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5^ 



5 



10 



20 



AATATTAATA 


TCACTAACCC 


TATAATGAAA 


AGATATACTG 


GATAGGATAA 


TACGOTGGCA 


600 


GATTCAGTTA AACCACGTAA 


ATTTAATATT 


AAAATAAAAA 


GTACAAGTAw 


ACATGCAATC 


660 


AGTrCicTTAT 




ACTTGGGAAT 


GCAGCAACAA 


ATGCATCAGC 


ACCAGATGAT 


720 






TATGTAATCG 


ACTAATAATG 


AGCCTCCTGC 


AAGCAATCCC 


780 




^ 1 AAA i X TjGT 


CTTGGACACC 


ATATACGCGC 


CGCCACCTTT 


AGGATATGCA 


840 


TAAATAATTT 


GCCTATAAGA 


CATAATTAAA 


GCAGCTAATA 


AGATCAGAAC 


AGCACCTGCA 


900 


ATCGGTAAAG 


TATACCAAGT 


TGCAACTGCA 


CCCACTACTG 


ATAGTGTAAT 


CAGTATTTGT 


960 


TCGGGACCGT 


AAGCCACTGA 


AGATAATGCA 


TCCGACGAAA 


GAATCGCTAA 


CCCTTTAAAC 


1020 


TTCX3ATATCT 


TTTCGTCTTT 


TAGTTCTCTG 


in-i-i-ri-iTAG 


GTTGCCCTAT 


AATAAGtCTT 


1080 


TTAAATTGAT 


kGAcATAAtC 


TCCTATTCCT 


TTTTATAGTT 


TTnGATGGAA 


ATCAC 


1135 


(2) INFORMATION FOR SEQ ID NO: 513: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 



35 



40 



45 



SO 



TTATGGTAAA 


ACTTTAACAC 


AAGCATTACA 


TAAATCGCAA 


TCATTTATGT 


GGCAAAAACG 


60 


TTGAATTTAA 


GCCATTTCTA 


TTCATTTTAC 


GTAATTATTA 


GCCGTATATG 


TGTAATAATA 


120 


CACATTTTAT 


TCAGATrrrr 


TATCGCGCTC 


CATTAAATCT 


TTTACGCATT 


CTTTTACTGA 


180 


GATATTTTCA 


AATAATACTC 


TATATAATGC 


ATTTGTAATT 


GGCATATCCA 


CATnTriTC 


240 


TTTAGCTAAA 


TGATAAACTG 


ATTTAGTTGT 


ATAAATACCT 


TCAACAACCA 


TATTCATTTC 


300 


AGATAATGCT 


TGATCCATTG 


ATTCACCTTG 


TCCAAGTTTA 


TATCCTAATG 


TGAAATTCCG 


360 


AGAATGTGTT 


GATGTGCAAG 


TAACGATTAA 


GTCACCGATA 


CCACCTAAAC 


CTAGAAATGT 


420 


CATAGGATCG 


GCACCTAACT 


TTTCACCTAA 


TCTACTAATT 


TCCGCTAAGC 


CACGAGTCAT 


480 


TAATGCAGCT 


TTTGCATTAT 


CACCGTAGCC 


AATTCCAGCT 


ACGATACCAC 


TTGCTACTGC 


540 


GATGATATTT 


TTCAATGCAC 


CACCAAGTTC 


AACACCAATC 


AAGTCATCAT 


TCGTGTACAC 


600 


ACGCAAATAA 


TCATTCATAA 


ATAAATCTTG 


CGTTAATTTA 


CTTACACTTT 


TATCTTTTGA 


660 


TGAAGCAGCA 


ACTGTAGTTG 


GTTGCTTGAC 


TACAACTTCT 


TCCGCATGAC 


TTGGCCCTGA 


720 


CAACACGCCA 


ATACCTGCAT 


TATATTCAGG 


TGAAATAGAA 


TCTTCAATCA 


TTTCTGACAC 


780 
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AAGATATGCA TACCGTAATG ATAGATATAG TTTTAAACGT GACTTCAAGC TATATGAATG 186 0 

TGATGACTGT TCATCATGTT CTTTGAGACA TCAATGCATG AAGCCAAATT CG AAATCCAA 1920 

TAAGAAAATT ATGAAGAATT ATAATTGGGA ATACTTTAAA GyCCAAATTA ATCAAAAGCT 1980 

TTCTGAACCA GAAACGAAAA AAATCTATAG TCAAAGAAAA ATTGATGTAG AGCCTGTTTT 204 0 

TGGATTTATG AAGGCTATTT TGGGTTTCAC TCXSAATGTCA GTTCX3AGGAA TAAATAAAGT 2100 

TAAACGAGAG CTAGGTTTTG TATTAATGGC ACTTAATATA AGGAAAATAG CAGCTCAACG 216 0 

AGCTGTACAT TATAAAATAC ATATCAAAAA AGCTGATTTC TATCAAATAA TTAATAGAAA 2220 

TCAGCTTTTT TACATTGCCT AAGAATTTAA TGTCCCAAGC CCTTTTTATC GAATAACTTA 2280 

TTGTAAACCT TGTCTTTCTT GGTTATTGTT TTCGTTATTT TTTTCGTGTT TTTGTTTCCA 234 0 

CrCTTTTTGA GTCATTACAT CGTCAACTTG CATGTTAACT TCAACTACAT CTAAACCAGT 2400 

AATATATTTA ACTTGTTCTT TAACTAATTC AGTTACTTTA CGGAAGATTT TTGGTGCTGA 2460 

TTCACCATAT TCTAAGATTA CTTTTAAGTC TACAGCAGCT TGTTTTTCAC CAACTTCAAC 2520 

AGATACACCT TGAGTAACAT TGTTGCCACT TGAGAATGCA TTAGTGAATG TATCAGTTAA 2580 

GCCACCTTTC ATATCTAAGA TACCTTTAAC TTCACGTGCA GCGATACCAG CAATTTTTTC 264 0 

AACAACTTCA TCAGAGAATG TTAATTTGnT TTTGAATTGA GGCTCCTGAT TTnGTTCnTG 2700 
(2) INFORMATION FOR SEQ ID NO: 512: 



75 



25 



30 



35 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

40 ATCAAATACA ATAATTAAAA TAaTGGCTAA TACaCCTAaA ATaATAATcG AGTTaGAGTA 60 

GCCTT^GCGA yCACCcTAAC aGTArACATT CTAGGCATAT ATTTaTCTTT AGACATTGAA 120 

GCCGCTAACA TCGGAAAAGC TGTAAATCCA GTGTTGGCTG CTAATACCAA AATCATCACG 180 

GTCGTTGCCT GTACAAAATA GAAGGCAGCA TTATCACCAA ATATTTGCAT AGCTAATTGT 24 0 

GATAAAACTG TCGTTTCCGT TTGTGGcAAA ATTCCATAAA CATATGCTAA ACCAACGATA 300 

CCAACTAATA AAAACGCTAA AATTGAACCC ATAGCAATTA ACGTTTTCAC AGCATTTTTA 360 

GCACTTGGCT CTCTAAAATT GGTTACCGCA TTTGAAATAG CTTCAACACC CGTTAACGAT 4 20 

GAAGCCCCTG ATGAAAATGC TTTTAATAsC AAGAATAATG TTACTCCAGG AACCGCAGTT 480 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTITCTC GCCCGCCATT 6C 

^ TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 12 C 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 18C 

TTAATCATCA TTTATTrGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 C 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC rrrATTTATA 3 00 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 4 20 

,5 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 4 80 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 600 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 780 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 1080 

35 CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATC7U\CAGGA ATAACXSAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCX5CTTT CCTAGGGTGC CGTCTC7VGCC TCGGTCTTCG 1440 

45 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 1680 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 



55 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 1140 

^ AGTCTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTCTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCX3CCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

10 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 1373 

(2) INFORMATION FOR SEQ ID NO: 510: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 717 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 



25 



35 



40 



TCAAGTGCCA 


ACTACACAAC 


TGATTTACAT 


TCTTTAGGTC 


AATATGTACA 


AGAAGGCCGT 


60 


CGCTTCTTAT 


TCGAAACAGT 


TGTAAAAGTA 


AATCATCCTA 


AATATGATAT 


TACTATTGAA 


120 


AAAGATAGTG 


ATGATCTAGA 


CGGATTAAAT 


TATTTAGCTG 


GTAAAACAAT CGACGAAgTT 


180 


AACACAAAAG 


CATTCGAAGG 


TACATTATTA 


GCGCATACTG 


ATGGTGGTGT 


TCCTAACATG 


240 


GTAGTGAACA 


TTCCACAATT 


AGATGAAGAA 


ACTTTCGGTT 


ACGTCGTATA 


CTTCTTCGAA 


300 


CTTGCTTGTG 


CAATGAGTGG 


ATACCAATTA 


GGCGTAAATC 


CATTTAACCA 


ACCTGGTGTA 


360 


GAAGCATATA 


AACAAAACAT 


GTTCGCATTA 


tTAGGTAAAC 


CTGGTTTTGA 


AGaCTTGAAA 


420 


AAAGrATTAG 


rAGAACGTTT 


ATAAAATACA 


TTACTTCAAA 


GATTAGTGAA 


GTTTGAAAAG 


480 


ATAGAACTAG 


ACGTTAACTA 


TTTAAAGCAT 


ATTTTCGAGG 


TTGTCATTAC 


AAATGTAAAA 


540 


ATGTAATGAC 


AACCTCGTTT 


TTATTTATAT 


GCAAGAACTA 


GGTTACTAGC 


TAaTGTGACA 


600 


AGATGTTAAG 


AGAAAATTAA 


aGATAAAATA 


ATATCTGCCT 


TACAATAATA 


TTGTTATACT 


660 


ACTAGnGACT 


GATTTATTAG 


CATGATTACA 


TGTTAATGTT 


TCTTTACTTA 


GTAATTA 


717 



(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 
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GGCGGTAACA CGGGTTCGAG 


TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 


4 80 


TGrrn"i"iTA ttacatattt 


« J. 1 o i 1 CjAGG AAGGTTGTCC 


GAGCTGGCCG AAGaGCACGC 


54 0 


CTGGaAAGTG TGTAGGCGTC 


1 1. r v-aagcjGTTCG 


AATCCCTTAC CTTCCGTAAA 


600 


GGCGCTTAAA TTGGTTTTAC 


CCATTTTAAG CGCTATTTTT 


ATTTTGGACT CAATCCCTTG 


660 


ATATATCTGC ATTTGAGCTA 


TTATCCTCAT TTTTACACTT 


CTTATTTATT TATATCCATT 


720 


TAAAATTTTT TAGCCACAAT 


GTGACTAATT TTTGaTGAAT 


AATCCTAATT TTAGtCACAA 


780 


GATTTTGAAG TTTAGTCACA 


AAACAAATCA TTCAGATTTT 


TTTCyATAAA TTTAGTTTCA 


840 


(2) INFORMATION FOR SEQ ID NO: 509: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S 



30 



35 



40 



45 



50 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


509 : 






TTCTATGAAA 


CTATCAGCTG 


TTTCTTTTTC 


CTTGGTAAAA 


ATCGTTTTAA 


CGCCATGATA 


60 


CATGTATTCT 


CTTTCAaCAC 


CCGATTGGTC 


ATAAGCATCG 


AAAACACAAA 


TAATTTCATC 


120 


TGAAATAACT 


GCATTATAAT 


TTGCAATTGC 


ATCTATTAAT 


TGCATTCTAG 


CTTCTTCTAA 


180' 


ATTCTCTTTT 


GCAATGGCGC 


TTAGCGTTGG 


TGATTGTCCT 


ATCATATTGT 


ATCCATCAAT 


240 


GATTAAGTAA 


CGTTCTTTCA 


TTATATTTCT 


CCAACATCAT 


GTCTTTTTCG 


AAATACTTCG 


300 


TACATCATTA AACTTGCTGC 


AACCGAAGCA 


TTCAAACTGT 


TTACATGTCC 


AACCATTGGA 


360 


ATCTTAATAT AAAAATCGCA 


TTTATCACTT 


ACTAGGCGAC 


TCATACCCTG 


TCCTTCGCTA 


420 


CCAATTACAA 


TAGCCAATGA 


CATGTCCGCT 


TCTAGATTTC 


TATAATCTGT 


TGCATTATTA 


480 


GCTTCAGTGC 


CAGCTACCCA 


AAAGCCATTA 


TCTTTTAGTT 


CATCGATAGT 


TTTAGCTAAA 


540 


TTTGTCACTC 


GAATAACTGG 


TACATGTTCA 


ATTGCACCTG 


TTGAGGCTTT 


TGCAACTGTT 


600 


TGCGTTAGTG 


TAACTGAACG 


ACGTTTAGGA 


ATAATAACAC 


CATCAACTCC 


CGTTGCATCG 


660 


GCTGTTCTTA 


AAATTGATCC 


CAAGTTATGT 


GGGTCTTCTA 


AGCCGTCTAA 


TATAAGTACT 


720 


GTCAATAAAC 


CTTCTTTTTC 


TTTTTGCTGT 


TTTAAAAATT 


GATCGAAGTC 


AGCATATTCA 


780 


TATGGTGCAA 


TAAGCGCTGC 


AACACCCTGA 


TGTGGTGCAT 


TTGCTAAAAA ATCTAATTTA 


840 


GATTTTGGTA 


CAGTTTGAAC 


AATGATTTTT 


TGATCTTTTG 


CAl-rx-riTAA AATTTCATTA 


900 


ATTTGTTGCT 


TTTTAATACC 


TTCTTGAATC 


AATATCTTAT 


TTAyCGGATG 


CCCAGTAATA 


960 
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TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 24 0 

TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 300 

AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCXSTAAATG 360 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT GTTTCATACC ACAGTTTATG 420 

GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 4 80 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 54 0 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 6 00 

TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 660 

ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA ACACATACAA ATTAATCCAC 720 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTC GAGTCCTACC 780 

20 TX3TGGAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG CGCCCCTGCT AAGGtGTAGG 84 0 

TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCGTT ACTTGCTAAA ATGGTATATA 900 

CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 960 

TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG GTTTTTATAC 1020 

TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGTTTTTA ACTATAAAAT 1080 . 

TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATGtACTCA 114 0 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 1188 

(2) INFORMATION FOR SEQ ID NO: 508: 

<i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 60 

ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 120 

TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 

GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 24 0 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 300 

TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 360 

55 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 54 60 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5520 

5 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 55B0 

ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 564 0 

GTACCAATTT CTTCATCTGT CATCCCACGG CX3ACX3ATTAA ATGCATCGGT TTTATAGTCT 5700 

ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 5760 
ACG 

IS <2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5763 



60 
120 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 
CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 
CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 
TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 160 
GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 24 0 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 
CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 
TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 420 
AT 

40 (2) INFOR^4ATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



300 
360 



422 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 60 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 



55 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 3660 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG TCATATACTA 3720 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 3780 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 3900 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCGCC ATTCATAAAT GACTGAATTA 3 960 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4 020 

,5 GTTTGAAAAA TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 4 080 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4 200 

20 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 4 260 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAATCTG CTGTAAAATA ATCAACTACT 432 0 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 4 380 

AACACCTTTT TAGAAAAGCA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 444 0 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4500 

TTAACTTCAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4 560 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA TGTAGTGCAA CATATTTTGA AATTTCTTCT 4 62 0 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 46 80 

TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 4740 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATGCTTG ATCTTGTTCA 4800 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4860 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4 920 

TAATCTAATT GATCAGTAAT ATCTTTATGA TTTGGAATTG TTGCATTATC TCCTGTTAAA 4 980 

GATGACGCTG CTTTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 5040 

TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCX3 GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 5160 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 5220 

AACAATGTTT TAGGATGGAA ATCTCCTTCT GCAAAGTCAG CAAATACTTG TGTTAAATCC 52 80 

CATACAGCAT CTTCGCGTTT TACTTTAACG CCATATGAAG TTTTGTCATT ATACTTGAAT 534 0 
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AAGAATCCCC 
ATATGCTTAC 
CTGTCATTGA 
AATTTTTAAA 
AAATGATTGA 
ATGTAGCGAA 
ATTTCTTTAA 
TTTAAAACAA 
GCGGTGCAAC 
TTGAAAAAGA 
TTGTTGAAGA 
AAATTACAGT 
CTGTATTAAA 
GCCCTGGTGC 
ATTTAGAAGA 
TGGTTGTAGG 
TACACCCTAC 
ATCAACCTAT 
TTAATGCTAT 
TGATTATTGA 
AACTTGATCG 
TTTATGCAAT 
TTCCTTTAGC 
ATGACACTAT 
CATTTGCGAG 
AAGTCACTCA 
TATATTATGA 
CAGATAAACG 
TAACTGAGTT 



TACCTCAATT 
TCATTTTTAT 
AATTGTAAAA 
TATTGACCGA 
GTACGCGCGT 
CAATATTACA 
TTTAAATATT 
TAGGAGGTAA 
ATGTGCCAGC 
TCGTGATATG 
TAGAAGATAT 
AAAAACTTAT 
TAGAAAGACA 
AAGTGCAAAT 
CACTGATGCT 
TGCAGGTTAT 
TTTAATTCAT 
ACTTGATGAA 
CAATGGAAAT 
AGGTGTCGGT 
AAAAGGTTTC 
AGGCGATATT 
TTGGGGCGCT 
TGAATTCAAA 
TGTCGGCGTT 
AGGTGCACAC 
CACTTCAAAC 
TATTGATGTA 
TGAAGTGGCT 



TTAATTGAAG CCGAAGAAAG TAAAATACCT 
GCCGATCATA TTGAGCATCG ACGCTGOGGC 
CTTGGTATTA ATAAAGCAAG AGGCATTGAG 
AATAATATTA TTGCATTCX3G TGATGAAGAT 
CACGGTGTTG CTATGGAAAA TGGTTTGCAA 
TTCAACAATA ATGAAGATGG CATTGGTCGA 
AGATATTACT GTTAATTTAT AACTAATCAT 
GTTACGATGC CCAAAATAGT CGTAGTCGGA 
CAAATTCGAC GTTTAGATAA AGAAAGTGAC 
AGCTTTGCTA ATTGTGCATT GCCTTATGTC 
GCTTTAGCGT ATACACCTGA AAAATTTTAT 
CATGAAGTTA TTGCAATCAA TGATGAAAGA 
AACGAACAAT TTGAAGAATC TTACGATAAA 
AGCCTTGGCT TTGAAAGTGA TATTACATTT 
ATCGATCAAT TCATCAAAGC AAATCAAGTT 
GTTTCATTAG AAGTTCTTGA AAATCTTTAT 
CGATCTGATA AGATAAATAA ATTAATGGAT 
TTAGATAAGC GGGAGATTCC ATACCGTTTA 
GAAATTACAT TTAAATCAGG AAAAGTTGAA 
ACTCACCCCA ATTCAAAATT TATCGAAAGT 
ATACCGGTAA ACGATAAATT TGAAACAAAT 
GCAACATCAC ATTATCGACA TGTCGATCTA 
CACCGTGCAG CAAGTATTGT TGCCGAACAA 
GGCTTCTTAG GCAACT^TAT TGTGAAGTTC 
AAACCAAACG AACTAAAGCA ATTTGACTAT 
GCGAATTATT ACCCAGGAAA TTCCCCTTTA 
CX3TCAGATTT TAAGAGCAGC TGCAGTAGGA 
CTATCGATGG CAATGATGAA CCAGCTAACT 
TATGCACCAC CATATAGCCA CCCTAAAGAT 



GAAATCAAAA 
GCACCATTCC 
CAAGTTAGAC 
AATGATATTG 
GAACTTAAAG 
TATTTGAATG 
TTTATAATAT 
GCAGTCGCTG 
ATTATTATTT 
ATTGGCGAaG 
GATAGAAAGC 
CAAACTGTAT 
CTCATTTTAA 
ACACTTAGAA 
GATAAAGTAT 
GAACGTGGTT 
GCCGACATGA 
AATGAGGAAA 
CATTACGATA 
TCAAATATCA 
GTTCCAAACA 
CCGGCTAGTG 
ATTGCTGGAA 
TTTGATTATA 
AAAATGGTAG 
CACTTAAGAG 
AAAGAAGGTG 
GTAGATGAGT 
TTAATCAATA 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

^ -X w V 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 300 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 480 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TCTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 1320 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACX3A TAACAAAGAA ATTTCATCAT 1440 

''^ ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

SO 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 
AnCCTGACAA GATTCTTATA TTCATAGATA TGAGAGCTAA AAATGAAATC AACAAACAnA 
CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 
TATTATAGCC ATTTCCAGCT AATAATIGCT GGATAAATCC ATTATTGTTT AATCTATTrr 
CAAGATTAGC AAtCGATGTG TTATTACTGA ATGAAACAAG TGCTATAAAC ATCGTAATGA 
CAGTAAGCAC TAATAACATC ACCCAACATA ACCAACCTAG AACTTTTTCA GrTAATCTAC 
TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 
ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATC ATT-TTATTTT 
AATGACATTT TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 
TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACG ATGGTCGTCT 
TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTCCATA GCCTACGAAT 
TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATGCACCC 
TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 
TTAGGAATAA CATAATTATA TTTATCCCCT TCrTGCACTA GAGGTCCTGC AACACCACTC 
CAAACAACCG CTATGTTACC TTCATTTTCT TGAAGCATCA TCGTAATTTC ATCACCTACG 
ACACCTCTTA CTTGTGGTGC TAGTITGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 
GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATGCCTAT AATCTCTCTA 
GCACCGTCAA CTAGTAAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 
TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 
TAAGGTAAAG AATATTTATT GCCTCTATCA AATCACATAT TCATATAATC •TCAATCTAAA 
TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 
TTTTGAACTG nATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTGCGAATT 
TTGGnGTCCA nCGCTTCATT TGAA 
(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ss 



NSCXX^ID: <eP_0786S19A2_l.> 



1430 



EP0 786 519 A2 



TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 174 0 



GAACG 

^ (2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 103 5 base pairs 
(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1745 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

TCGTCTTTAA TCTTGCTGAC TTTAGAAGGC TTACGAGTGC GGTTACCATT TTTTGCATCT 60 

TTAACTGATT GAACTAAAGC TtGACGTGTA GATTTATCAG CTAAACTAAT TGCACCACCA 120 

20 ATTACGGCAC CAATTAAAAT ACCAGGAACA AATTTATTTT CCATAAAAAA CTACCCCTCT 180 

TTCAAATTTG CATCTTTTAC GATGTAGTCT ATTAAATTAT CACAAGATGA TAATACCATG 240 

TCGTATACAC CTTCAAAATT ATTCGTGTAG TATGGATCTG GTACATCACT CTCTTCCATA 3 00 

TTACTAAATT CTAACAGTTT GAACAATTGT CCCTTAAGAT TAGGATTGAT AGATTTAATA 360 

TTATCAACGT TACTTTGATC CATAGCCACA ATGTAATCAA AATCATCTGT CGCTTCGAAT 420 

AATTCACTAA TCATGCCATC AAATGGAATA TTGTGTTTGT TGAGAATTTT TTGTGTACCT 4 80 

TCATGAGGTG GCTCTCCTAA ATTCCAGCTA CCAGTACCTC TTGAATGTAC TTTTIATATCA 54 0 

TGAATATTTC TGTCTTTAAG TCTTTGTCGC ATGATTGCTT CTGCCATTGG AGAACGACAT 600 

ATATTGCCAA GACAGACAAA TGCTACATCT ACCATTTTGA TTCCTCCAAA CTATGTAGTT 660 

ATATCCCCAT TTTATAGCGA CTTTAAACAA TAAGAAAGCA GATTATATAA AATTCTATTA 720 

AAGTTTATTA AATTGTGATA CTTTGATAAC ATAACTATTA TTAGAGGTGA ACATTGTGGC 780 

TATGACAAAT GAAGAGAAAG TnTTAGCTAT TAGAGAGAAG TT/^TATtG TTAATCAAGG 84 0 

ATTATTAGAT CCTGAAAAAT ATAAAAATGC AAATGaAGAA GAATTAACAG ATATATATGA 900 
TTTTGkTCaA yCAAGAGaAA GATTGTCGCC AAGTGaAGTG mCAGCTATTG CTGaCGCTTT 960 

AGGACAATTG CGACACGAAT AGGAGTGGGA ATTTTGACTA ATTACAAAGA AAAGTTACAA 1020 
CAATACGCTG AACTA 

(2) INFORMATION FOR SEQ ID NO: 504: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 
CTGTACATAC AGCAATATCG TTAACAAATC AAAACAGTAT TTTAGGATTG TAAACCATCA 
TAAACCTAAA ATACTGTTAT TTTTATTACT TAAATTTCTT CTTCAATGCC ITTTCAACAT 
AAGGTGGAAC GAATTCAGAA ATATCTGCTC GATAAGCTCC AACTTCTTTA ACAATACTTG 
AACTTATAAA TGAATAATTA GTACTAGACA TCATATATAA CGTTTCAATT TCATTCTTCA 
ACTTTTTATT CATTGAAGTT AAGCGTAATT CATATTCAAA ATCACTGACT GCTCTTAAAC 
CACGTATGAT TGTTTTAGCT CCTACTTGTT CACAATAATC GACTAGTAAA CCACTAAATT 
GATGAACCTT GACATTAGGT AAATGTTTAA CAGATTCTTC AATTAAATCC ATACGCTCTT 
CTAAACTAAA CGTACCTTCT TTTTTACTAT TTTTAAGAAC ACAGACATGA ATTTCATCAA 
ATCTATCTGT ACTTCTCTCA ATAATCTCTA AATGACCATA AGTAATCGGG TCAAAACTAC 
CCGGAATGAC CGCTATTGTA TGTTCCATGC TATTCTCCCT TTTCTAATAA CAATGTCTCT 
GTCAACCCAT AATGGTAACG TTTAATCATA TTAAACGGTT GATAATCtAT TTCTTCATGA 
TTGCTAAATT CACAAACGAT GATACCATTT TCTTTCAATA AAITAAACTC TGAAAITAGT 
TTTAAAGCTT TATCAATGAG ACCTTTATTA TAAGGTGGAT CTAaGAAAAT GACATCAAAT 



45 



900 
960 



- — w*«*^^v, w^AArtAixii AlAAACTTCA 84 0 

GATTGTGCCT CTAAATCCAA ATTCGCAAGA TTTGaTTTAA TAACTTTTAC AGCTTTAAAA 
TTT-TGaTCAA CAAAGATTAC CTTATCCATA CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 
CCGCTTCCTG CAAATAAATC TAAACCTATA CCTGACACAT CATATAAACT ATTAAAGATA 1020 
CCTTCTTTAA CTT^ATCCAT AGITGGTCTC GTATTACGGC CTTCCATACT TTCTAAAGCT 1080 
TTACTTTTAT GTTTACCTGC AATGACGCGC ATGITGTTCA CACTTCCAAT TCATTTAGTT 
^0 ATTTAATATA ATTTATTGAG AAAAAGGAGA ATGATAAACC AATGAAACAA ACATTTATTA 

CAerTGGTGA AGGTCTAACA GATTTGTTCG AATTCATGAC GATGATTGAA TATAACCATC 
AACGTATTGA TAAAATTATC TATTTTCATT CACCACAAGC TGAAAATAAA AAGTCATCTG 
TAGCAATCAT TATGAACCCT ACAACTGGCA ATCATTTCCA AGCATTTTAT ATCATGATAA 
ACGCTATTAA ATATCCATAT CCAGATTCAA ATAAAAAGTT TCAAATGATA AATGATTGTG 
CTGAAAAATT CGACATACCA ATTTTAGGTA TCGATGTACA GCCCCCTCAA GCATTTCATG 1500 
ATTTATCGTT ATATTATAAT TATTTAATTA GTGTGTTAAG GCTCCAAAAA TGGATACCAG 1560 
aACTTCAATA ATATTAATTA TATATTTCGT GTTTCTCTrr ITCGTAAGrr TTCTTTAAGT 



1140 
1200 
1260 
1320 
1380 
1440 



1620 



ss 



1428 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



TGAGAGAAAA GTTTAGACAC TATTGTGAAC ATGAAGTTCT AAGTTCAATC ATCAATGGTT 240 

CATACATTAT CGTCAAAACC TCACCTGGTT TCGCCCAAGG CATAAACTAT TTTATCX5ATC 3 00 

AGCTAAATAT AGAAGAGATA TTAGGTACGG TGAGTGGAAA TGACACTACA TTAATCTTAA 360 

CTGCCTCAAA TGATATGGCA GAATACGTAT ATGCAAAATT ATTTAAATAG ACATGTATCA 420 

AATGAATAAT AAAAATTTGT TTCGTATCAC GTGTACTCAA GTTAGTTACC AAATATTAAC 4 80 

TTGTGTACGC GTTTTTTTAT GGAAAGAAAG AATTCATAGT CATTCAATTG ACTGTATAAA 54 0 

AAACTTTATA CAACATGTTT TTATGGGTAT TTTTGAATAA AAAATGTATA TTTTGACCCA 600 

AAATACCTTT ATTTATGTAT AAAAATCCAT TATTATGTAT TGTATAACAA AAAGATATGA 660 

AATTTTCGAC TTTCTTTATG TGAATATAAT CACATGTAAG CGTTTGAAGA TTGTCTATAC 720 

TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA CAGATGGTCC AATTAAAGTA 780 

AATAGCGAAA TTGGAGCrTT AAAAACTGTG TTACTTAAGC GTCCTGGaAA AGAATTAGAA 84 0 

AATTTAGTAC CTGATTATTT AGATGGATTA CTATTTGATG ATATTCCATA TTTAGAAGTA 900 

GCTCAAAAAG AGCATGACCA TTTTGCGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 960 

TACCTTGAGA AGTTAGCAGC TGTVAAGTATT GAAAATCCTC AAGTAAGAAG TGAATTTATT 1020 

GATGATGTAT TAGCAGAGTC TAAAAAAACA ATATTAGGTC ATGAAGAAGA AATTAAGGCA 1080 

TTATTTGCGA CACTTTCTAA TCAAGAACTT GTAGATAAAA TAATGTCAGG GGTACGT7VAG 1140 

GAAGAAATTA ATCCGAAATG TACACATCTA GTAGAGTATA TGGATGATAA GTATCCATTC 1200 

TATTTAGATC CAATGCCAAA CCTTTATTTT ACTAGAGATC CACAAGCCTC AATAGGACAC 1260 

GGTATAACAA TCAATCGGAT GTTCTGGAGA GCACGACGAC GAGAATCAAT ATTTATTCAA 1320 

TATATTGTAA AGCATCATCC TAGATTTAAA GATGCGAATA TTCCAATCTG GTTAGATCGA 13 80 

GATTGCCCAT TCAATATTGA AGGCGGCGAT GAACTTGTTT TATCTAAAGA TGTCTTGGCT 144 0 

ATAGGCX5TTT CAGAACGTAC ATCTGCACAA GCTATTGAAA AGTTAGCGCG ACGTATTTTT 1500 

GAAAATCCGC AGGCGACGTT TAAAAAAGTA GTAGCAATTG AAATTCCAAC TAGTCGAACT 1560 

TTTATGCACT TAGATACAGT ATTTACAATG ATAGATTATG ACAAATTTAC AATGCATTCA 1620 

GCCATTTTAA AGGCAGAAGG CaATATGAAT ATATTTATTA TTGAATATGA TGACGTAAAT 16 80 

AAAGATATTG CCATCAAACA ATCTAGTCnT TTAAAAGATA CTTTAGAAGA CGTACTAGG 1739 
(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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10 



TAGAAGTGGT GAAAATTTTA GTCX3CAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 
TTGCTTTGTA GAAGAAGTAC AATTTGATGA 1XJGTX.TTGGT GATTTTAGAC TTTTAAGCCA 
AAGAGCTGTT AAATCCATTO CATCACTTCA AGAATATAAT CGnTTITCAA AAnGGnTATT 

TGA 

(2) INFORMATION FOR SEQ ID NO: 500: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 TOPOLOGY: linear 



1500 
1560 
1620 
1623 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


500 : 






20 


AAAGTnGGTG 


AAnCTATATA 


CTTAATCTAT 


ATTTATATAT 


TAACCATTAG 


GGTTAAAAAT 


60 




TACTCTAGCA 


TTTATGAATA 


GATGGGAGTT 


TATTTTATTA 


TTATATAGGA 


GAGATGTTGA 


120 


25 


ATGACACATC 


GCGCACTATT 


AGTTGTTGAC 


TATTCATATG 


ACTTTATCGC 


AGACGACGGC 


180 


TTACTAACAT 


GCGGTAAACC 


TGGACAAAAT ATTGAAGATT 


TTATTGTTTC 


TCGTATCAAT 


240 




GACTTTAATT 


ATTATCAAGA 


CCATATATTC 


TTTTTrrTGG 


ATTTACATTA 


TTTACATGAC 


300 


30 


ATTCATCATC 


CTGAAAGTAA ATTATTCCCA 


CCACACAATA 


TCGTAGATAC 


AAGTGGTAGA 


360 




GAATTATACG 


GTAAAGTAGG 


TAAATTATAC 


GAAACAATTA 


AAGCGCAACC 


TAATGTACAT 


420 




TTCATTGATA 


AAACGCGCTA 


TGATTCGTTC 


TTTGGTACCC 


CGCTTGATAG 


TTTATTGAGa 


480 


35 


GAAAGAAGTA 


TTAATCAAGT 


CGAAATCGTT 


GGTGTATGTA 


CCGATATTTG 


CGTGTTACAT 


54 0 




ACAGCAATTT 


CTGCATACAA 


CTTAGGtTAT 


AAAATTTCAG 


TACCTGCTGA 


GGGAGTGGCT 


600 




CATTT 












605 



40 (2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 9 base pairs 

(B) TYPE: nucleic acid 
45 <^> STRANDEDNESS: double 

<D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 

GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 12 0 

55 
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15 



20 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT 6 0 

ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 120 

aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AATCTCTGAC GAAGATATGA 180 

CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATAGAGC GTTTGATACA GCATACTTTT 240 

ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 300 

TGTTTATAAC AACGAAGTTA TGGAATGACT ATCAAGGTTA TGAGAAAACA TTCGAATATT 360 

TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 420 

25 CTTGTGAAGC AGATGGTCTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 4 80 

AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCATCATCTA GAAAAATTAA 54 0 

TGGCTCAATC AAGTATCAAA CCAATGGTGA ATCAAATTGA GGTACATCCA TATTTTAACC 600 

AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 660 

TGATGAGAAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 72 0 

ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 780 

CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAATTTAG 840 

AATTAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 900 

CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 960 

GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 1020 

TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 1080 

45 CATTCCTTGT TTTAATGAAG GGGAAGTCAT TACACAAACA CATCAACAAT TAACTGAAAT 1140 

ACTTTCACAA GATAGTAGTG TGAAAGGCTA TGATTATAAT ATGCTTTTCA TAGATGATGG 1200 

TAGTACGGAT ACCACTaTAG ATGAAATGCA ACATCTrGCC ACAATAGATA GGCATGTCAG 1260 

CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 1320 

TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGTGAT TTGCAACATC CACCTGAATA 1380 



55 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 




10 


AGCrCACGTC ATCTTCGGCG GCGCTAAATT AAAATAATCA ATTTCTCAGT TAAACTTTTA 


60 




TTTACAACAT ACTATTACTA TACATTACAA AITTTTAAAA TATACATTAC ACTCATTArr 


120 




CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCTlM-rr 


IBO 


IS 


TCAATATATA TACAACTATT AAATCCCATA AGATTGCAGA GCACATAAGT AAATTTl-riT 


240 




AGAGCTTGAG GTTTGTTTAG CTTAAGCAAC CCATCAGCTC AAACACTTCC TCTTCACATa 


300 


20 


ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTrTAT ATAAATTACA CACAATAAAT 


360 


AGAAAGAATG TGAACATCAT nanTnnm-rTv rr-rr.^*^^ 

* X XA\«AX X xnX \_M.X X^VjXvjTG 


420 




GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTArrf^ 


480 


25 


AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGr^ 


540 




TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTrfirAr-ar^ 

j-Avjxxvjxv-v_v_ v- X vsvjACjAG 


60 0 




ATTGCAATTG GATTAGGTTT AATTTTTGGA GITTTTGCAT ATGCTGCTAG TTTr-r^^^ 


660 


30 


GCCTTTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACGTATCC TCTTCAA-i-ra 


72 0 




ACTTTCTTTA TCCrri"i'ACT AATGAGTCAC TCArTGTTAA AACAGATTTC ACTTAAArAa 


780 




ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 


840 


35 


GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGT 


900 




TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTITACTAT CAAATGATAT 


960 


40 


TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGTCAAArA 


1020 


AAtGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTCCCAAAA CACAAGAACA 


1080 




TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCf' 


1140 


45 


AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 


12 00 




ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACrTATT AACTTAAGTA AAGTtGTT^r 


1260 




tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT f:rTA'n-r*o^ 

* ' »"w>rf<-wv XXX X X M. X X u 1 VjCj X A X X 1 AGO 


1320 


SO 


rrCTAGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATCA 


1380 




CTATTACGAA GATGCTAATA CCGTGAATGT CCATATACAC CGTATTAGAG AnAAATTAGA 


1440 




AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTCA 


1500 



55 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 780 
ATGAAATGCC ACAC ''94 
5 (2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 97: 



20 



25 



30 



35 



AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


GAATATATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 


AGAAATTTAT 


TCTTCTTTTT 


CTi-lTCAGCA 


GTTATCATTC 


CTAATTGTCC 


TTTCTTTTTA 


120 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT 


AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA 


TTATTTTTCA 


240 


AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


TTTCTCCACC 


TATTTTTACA 


TTGTTTTTTA 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 


CTAGTGACTC 


AATATGTTGT 


TTGCTTTGAG 


TATACTTAAT 


CTCCGGCTTA 


TGTCTCATTA 


420 


CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATGTTTTA 


TCATAAGGTT 


480 


TTGTCGCATC 


ATGTTGAAAA 


GCTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


540 


TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC 


AACTTGCCCT 


TCTGGCATTA 


600 


AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTGCACT 


ACATGCATCT 


AATACGTGAT 


660 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA 


GCrriTATCT 


TGAATTGAAA 


720 


CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA 


TGGAGACAAT 


780 


AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC 


TTGCAACTTT 


TCAATAATAT 


840 


CATCTAATGA 


TGCTCGCGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTCGTT 


TCTAAAAATG 


900 


ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA 


TCTATAATCC 


960 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


rrrrriTAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACGCATC 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 


AGCCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


1X40 


TATCTAnATA 


nACATATTGA 


T 








1161 
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TACACCTTTA TATTTAGCAG 


gtgctactga atcccaagtt 


GATTGTAAGA 


1140 


ATTGATA PTT 


ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT 


GCCACCTGaT 


1200 


TCACG*nY?T^ 


v-AA i i rGTTT TAGATGAGCa 


TTCACATTTA CTGATX5AACC 


TTCTGATGAT 


1260 


A X X VVX X ^_nVj 


A xkjr4ji\,x iCaC AGTAACTTGT 


GAATTGTTTG ATGTTGATGC 


TTGTGGTTGT 


1320 


A XXX w/Wj 


^--Ai 1 i 1\jTGG tgcttcaact 


TCTTGTGATT GTACTTGATT 


AGCTTGAACA 


1380 


GCTGATGGTG 


f*^*-"!. 1 AG 1 1 \3 CAGG T 


GCTTGTGCAC TCATGTCTGC 


TCCATTAGTA 


1440 


a a X X 


CtGTAATTCCA agcaaagtgt 


GTACCATCTG ATTCAAAGTG 


ATAAGTAAAC 


1500 


CCTTCATAGT 


caaatgtata attataagcc 


CCAGCTTCAA TTGGTTTTTG 


ATTTAATGTT 


1560 


i- vi*\l V..:AX X 1 V3 


attgcgccat ttgcctgaaa 


GATGCTTTAT TTAAGTCCGC TTCACnTGCA 


1620 


TGGGCTTCGT 


ggacctgcat ttcctggcta 


CGATTCCTAA ACCTACTGGC 


nAAnGATGAT 


1680 


GCGAGTAATG 


TTTTCTTCAT AATCTTAAAA 


TCCTCCTACA AGTGAATTTG 


TGTCTCTAAA 


1740 


AGTTTTACAG 


tggacgactg T 






1761 


(2) INFORMATION FOR SEQ ID NO: 4 96: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 





TCATTTATGA AAAATGTCGG 


AnAGAnCCaa 


GaAAmaCAAT TGAgCGTGAA 


GAAAAAGCAA 


60 


35 


GACTTAAAGA AGAACAAAAG 


GCACGTCAAA 


ATGAACAGCC ACAAATAAAA 


GATGTGAGTG 


120 




ATTTTACGGA AGTGCCTCAA 


GAAAGAGATA 


TTCCAATTTA TGGGCATACT 


GAAAATGAAA 


180 


40 


GTAAAAGCCA GAGTCAACCA 


AGTCGAAAAA 


AACGAGTGTT TGATGCAGAG 


AATAGTTCGA 


240 


ATAACATCGT AAATCATCAT 


CAAGCAGATC 


AGCAAGAACA ATTAACAGAA 


CAAACTCATA 


300 




ACAGTGTTGA AAGTGAAAAC 


ACTATTGAAG 


AAGCTGGTGA AGTTACGAAT 


GTATCGTATG 


360 


45 


TTGTTCCACC GTTAACTTTA 


CTTAATCAAC 


CTGCAAAACA AAAAGCAACA 


TCTAAAGCTG 


420 




AAGTGCAACG TAAAGGACAA 


GTACTAGAGA 


ATACATTAAA AGATTTTGGG 


GTAAATGCAA 


480 




AAGTGACACA AATTAAAATT 


GGTCCTGCAG 


TAACTCAATA TGAAATTCAA 


CCAGCTCAAG 


54 0 


SO 


GGGTTAAAGT GAGTAAAATT 


GTAAACTTGC 


ATAATGATAT TGCATTAGCT 


TTAGCAGCAA 


600 




AAGATGTTAG AATCGAAGCG 


CCAATACCTG 


GTCGTTCTGC AGTAGGTATT 


GAAGTGCCAA 


660 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 1020 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 12 60 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 

(2) IKFORMATION FOR SEQ ID NO: 495; 

(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 

TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

TAATTTATTC ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 120 

ATCAAACCAT TTTGCCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 

AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 240 

TTCACGACTA TGCGCTTGGA AATACATATG TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 300 

AACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG 3 60 

CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

35 CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 480 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 54 0 

CTGAATAAAG TCATCCTCAT AAATATCATT AATGATTGGC TTTGCAGCTT GGTACAATTT 6 00 

TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 660 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 720 

TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA TATCATTAGG 780 

CTCCCCTAGT AGTTCCTTAG TATTCAATTG CAAATTAATC TTAGCAAACG GTTTCAACAT 84 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 900 

TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 960 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATGCAGTA ACCCAATGTC 1020 



25 



30 



45 



50 



55 
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5 



15 



TATCCAGAAG 




ATAGTACAAA 


TAAAATCTAC 


CAAACTAGGT 


840 


TTGAATGATT 


TAGTCCGTAT AriT a r* i a r^rvr* 


AAAACGATTA 


GGGGTATAAA 


CAATGTAATT 


900 


GTTAAGCAAG 


ATGTAACGCrr 'TW3m&r"n*p 


AATCGAGAAC 


AACGATATAT GAAAAAAGTT 


960 


AATACTGCAG 


CTAACTATGT TTrTW^ a tt^ k 


AATGATGTTA 


ACCTTTCTAA 


TCCTAGTAAA 


1020 


GCGGCAGAAA 


■'^ ivjAavvj i V, X AAAGTAGCG 


TCAATAGCTA 


AATCAACACT 


CGATTTGATG 


1080 


AGTAGAACTG 


ATTTGATTGA AGATAAACAA 


CAGAAGGTAA 


GCTCTAAAAC 


TGTGACTACA 


1140 


TCTGACGGCA 


CTATCGTTCA TGATTTTATa 


GATaAATCmA 


ACATTAAaGA 


TGTAAAAmCX; 


1200 


aTTGGAACGa 


TTGGCGATtC TGTAGCTAGA 


GGATCACATG 






1240 


(2) IbTFORMATION FOR SEQ ID NO: 494: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


494 : 








ACGGTGGATT 


TAGCCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA 


TTTAGCTTTA 


60 




ATGATCAAaC 


AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC 


GTAAAAGATG 


120 


30 


AGCAACGTGC 


TGGCGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT 


TATTACAAAG 


180 




ACACATTTGG 


TCGTGAATCA 


TATGaCAACC 


AAGGTAGTCC 


AATTGTTTCA 


TTAACGCATG 


240 




TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA 


ATAATGCCGC 


ATGGATCGGT 


GACAAAATGA 


300 


35 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT 


GACGTAGTAG 


360 




CACACGAATT 


aACACACGGT 


GTGACACAAG 


AGACAGCGAA 


CTTAGAATAT 


AAGGACCAGT 


420 


40 


CAGGCGCTCT AAATGAAAGC 


TTTTCAGATG 


TTTTTGGATA 


CTTTGTAGAT 


GACGAGGATT 


480 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA 


CGCAGCATGT 


540 




CAAACCCAGA 


ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC 


ACTGAA7VAAG 


600 


45 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA AGCAGCTTAT 


AACGTGATTC 


660 




AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG 


GAATACTTAA 


720 




CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT 


AAAGATTTAT 


780 


50 


ATGACgAGCA AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC 


GTGGAGTAAA 


840 




AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT 


TTAAATACAG 


900 
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TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 180 

AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 240 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 300 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 54 0 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 660 

GTTTACCAGT ATC 673 
(2) INFORMATION FOR SEQ ID NO: 493: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA J^TATACTTA 60 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 240 

TXTTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATOACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 480 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 54 0 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 91: 
TATCCACCCC ACGAnAGCCC CGGAAACTTA TTGTGTTACA AGATATATAA GCAGAAACGA 
S ACAACAGTTA ACAAAATAAA TGAAATTAAA CGTTTTAAAA ATGAAACAAA TGAAATCATC 

TATTAGGTTA TGAAACTGTT TATAGCTTGA ATAGAAGCAT TTATTTTTTA GGAGGACAAT 
TATTATGCGT CAAACATTTA TGGCAAATGA ATCAAACATT GAGCGCAAAT GGTATGTTAT 

10 

CGATGCTGAA GGCCAAACAT TAGGTCGTTT ATCATCAGAA GTAGCATCTA TCTTACGCGG 
TAAAAATAAA GTAACTTACA CACCACACGT TCATACTGGT GATTATGTAA TCGTTATTAA 
TGCATCAAAA ATCGAATTTA CTGGTAACAA AGAAACTGAC AAAGOTTACT ACCGTCACTC 
AAATCACCCA GGTGGTATCA AATCAATCAC TGCTGGTGAA TTAAGAAGAA CTAACCCAGA 
ACGTTTAATT GAAAACTCAA TTAAAGGTAT GTTACCAAGC ACTCGTTTAG GCGAAAAACA 
AGGTAAAAAA TTATTTGTAT ATGGTGGCGC TGAACATCCA CACGCTGCAC AACAACCAGA 
AAACTACGAA TTACGTGGTT AATTAGAAGG AGGAAATGAC TTrGGCACAA GTTCAATATA 
GAGGCACAGG CCGTCGTAAA AACTCwGtAG CACGTGTACG TTTaGTACCa GGTGAAGGTA 
ACATCACAGT TAATAACCGT GACGTACGCG AATACTTACC ATTCGAATCA TTAATTTTAG 
ACTTAAACCA ACCATTTGAT GTAACTGAAa CTAAAGGTAa CTATGATGTT TTAGTTAACG 
TTCATGGTGG TGnTTCACTG GACAAGCTCA AGCTATCCGT CACGGAATCG CTCGTGCATT 
30 ATTAGAAGCA GATCCTGAAT ACAGAGGTTC TTTAAAACGC GCTGGATTAC TTACTCGTGA 

CCCACGTATG AAAGAACATA AAAAACCAGG TCTTAAAGCA GCTCGTCGTT CACCTCAATT 
CTCAAAACGT TAATTGTCGG ACGATATATA CAAAACACCT CGATAITATG TCGAGGTGTT 
TTTTTGGCGT TTTTGCGGCG AATATGGAAT GTCTAGAATA TAAATCAATT TTTACCTTCC 
CACCATAAAA GATGAAGAAC CATGAATGTG GAGAACAATA AATAGITCGA TATTCTGTTA 
TTTTTTTGGA AGTGGAAGTG GATTTGGAAT ACTTTACTCn AAACGATTAA AAGGTrTAAA 
AAAACAACAA AnAGAAA 
(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 
4S (A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



SO 



SS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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ACTCAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

5 CTGATAGCrC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 1140 

GT6TATGGAA CXyTCTTGATG ATCAATGTAA GAGCCATCTT TATAA/^TCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 126 0 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 

(2) INFORMATION FOR SEQ ID NO: 490: 

<i) SEQUENCE CHARACTERISTICS: 
20 <A) LENGTH: 564 base pairs 

<B) TYPE: nucleic acid 
< C ) STRANDEDNE S S : doub 1 e 
(D) TOPOLOGY: linear 



25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90: 







AGTAAAGCGT 


CGTTAATTAA 


ACAATGGGAC 


CAATTTGTTC 


GTCTTATATA 


60 


30 


AATGTACTTT 


aCCTTTATTT 


TTACAAAAAT 


AGCATTTTCC 


TATGTCATTT 


AACTAAACAT 


120 




GTAAGTTCGT 


ATGAACGAGG 


TTTGTTAAAT 


AGATGATTCT 


AGGAAAATGC 


TTTTTTCTTT 


180 




TGACTTAGTT 


TAAAATATTT 


TGCCACTTTG 


TACTGATAGT 


AGTTGCATTG 


TACTGTTGTG 


240 


35 


CAGATTCTAT 


GCTATTAATT 


GAAAACTGTT 


GCAArrriTG 


AGTATTATTT 


AGTAATTGGT 


300 




CGACCTTTTC 


AACCATTTGA 


TTGATGTCAC 


CTTGAGGTAC 


TAAATAGCCA 


TTAAATCCAT 


360 




CTTGaATCAG 


TTCTGaTGGA 


CCATAATCTA 


CATCATAACT 


GATCACTGGT 


GTACCTACTG 


420 


40 


AAAgCGACTC 


TAAAATTGCT 


AAGCCAAAAC 


CTTCCATTTT 


ACTTGTCGAT 


AACATCAGTT 


460 




CTGCTTTAGC 


AATCTCTTCA 


TTAATATGCG 


TCTTAAAACC 


ATGAAATTTA 


ACATGTTCCA 


54 0 




GATATnATGA 


TAATCTTCTA 


CAAG 








564 


45 


(2) INFORMATION FOR SEQ ID NO: 491: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 88: 



5 


GTAGGCACAC ATCTGCCATA 


TAAACATTCT 


TTTATACTAG 


TGTTCTCATA 


TAGTGTAGAG 


60 




TTATAGTCTC CTTCTTGAAT 


CTCGAATAAT 


TCAATCAACC 


TATCAACCTT 


AGTCTCTTCC 


120 




GTTACTTCTT TTTCAATATC 


AACTATGAAG 


GGGATATCAA 


TTGGAATAAA ACTTGACGTC 


180 


10 


GAACACTTAT TTGTATTTGG 


ATGAAAACGA 


ACGAATCCAT 


CACTAAATCC 


TGTTGAAAAA 


240 




AATATTTTTC CTTGTGATAG 


ATCCGGATTT 


TCTCGCGCCC 


ATTTAATTAA 


TTCATCTAAT 


300 




CTCATTTCTT TTTTAACTTT 


GATTTTCATT 


GTTATATCTC 


CTCTTGAACA 


GTAAATTTAT 


360 


15 


CGTTAAtTGA TACGTATCCA 


GTCACATTAC 


ATAAGATGCT 


ATCAACATCA 


AAAGTCACAC 


420 




AACAGTTGCG TTCAACATCA 


TTTGAATAGA 


ATCT 






454 




(2) INFORMATION FOR SEQ ID NO: 4 89: 







(l) SEQUENCE^CHARACTERISTICS : 

(A) LENGTH: 13 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


489 : 






30 


TTGTCAGAAT 


TAGAATGCTT 


TTGAGTTACT 


TCATAATACT 


CATCAGTTTT 


TTGTGTATCC 


60 




TTTTGACTTT 


TATTTATTTC 


TTTCCACTTA 


CCAGTATGAC 


TTTCTTTTTT 


TACAGTTATT 


120 




TTCGGTTTGT 


TTAAAAAATG 


ATAACCGATG 


TTCTTTTTGG 


TATCTGTGGA 


CTCTAAAAAG 


180 


35 


ACTGAATTGT 


TTTCCTGATT 


ATCAGAATTG 


GTTGTTTGTT 


TATCGTCTGT 


ATATAACGTA 


240 




TACCCATTCG 


CTTTGCGATT 


TTCAATCGTT 


GTAACTGGAT 


TCTTTGATGA 


ATCAGTACTT 


300 




TTAATGCCAG 


TTCCTAAGAA 


GACAATTTTA 


TCGTTTAATA 


TGAAATATGA 


TTrrrrcGCA 


360 


40 


GTTAAAGTTT 


TGTCCTGATT TTCAAAATCC 


ATTCCGATAC 


TAGCATGTTG 


GTCATCAACT 


420 




TTTGTTCCGC 


CAACAAAAGT 


TTTACTCGAC 


TTTTTATCAT 


CCGTATCTTT 


TAATATTTCA 


480 




TTGTCTAAAG 


TTGTTGTACC 


TGATAAACGT 


TTCATATCGG 


CTGTCACCCA 


GAAGTTATCA 


54 0 


45 


TGATAGTGTT 


TGACATCGCT 


GTTATATAAA 


TAAGACATTC 


CAGCACCAGT 


GTGCCAACCT 


600 




TTTAAATTCT 


CTCCGTTGAT 


ACTTTCATAG 


CGTGCTACGT 


TTTTCGACGT 


CATACTTAAA 


660 


50 


CCAAATGCAA 


AGTCTAAGTC 


TTTGTTATGA 


TAGGTGACAC 


GATCCATGTC 


ATTATATATT 


720 


TTAAGTTGTT 


GTGTTAATCC 


GTTTTTAGAA 


ATACTGTTAT 


CTGTCATTAA 


AGACTTCATT 


780 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 660 

TTGCTTAaCT TCGGaTATTG AACGCATCAG TCCAATTTGA CATAGAGCCT TTTTTAGTTC 720 

S TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 7 60 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 64 0 

GGATTTTTAT TGGTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 960 

ATCACCTTAT ACACCTCCTC TCTGCGTCTA AATTGACGsC TGAGaGrTAG GcGACTCTAC 1020 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 1080 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGATATTGT 114 0 

TAATTTAAGA AAGTAGGCGA TTTTATTATG ACAAGAGAAA GAAGATCATT TAGTTCAGAG 1200 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 126 0 

GAGTATGATT TCACACCTTC GACGTTTGTA AATGGCGGTT ATAAAATGTA GGAAAATGGA 13 20 

TAAAGCAACA TCAAAACACG GGTACATTCA ATCACCAAGA TAACTTATCG GATGAAGAAA 13 8 0 

AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 144 0 

AGCAAGTAGC GCTGATTATG GGGCAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 150 0 

TTCAGTATCA GCAATGTGTA AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 1560 

AAAAAGAAAA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 1620 

TATTTTTAAT TCTAATAGAA AAGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 1680 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 174 0 

35 TTATACGAAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 1800 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 

GATGTAGCTC ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT 1920 

AACCAAAATT TATTTGATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 1980 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTCAATAT 2040 

GTAGAAACAG GTAAAGATAC TGCAACATTA TATCTTTCTT CTGCATATAC AAAAACAATA 2100 

GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 4 88: 



25 



30 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



55 



141S 
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EP 0 786 519 A2 



10 



15 



20 



2S 



30 



AATGTAAAAT GTTCAAAGTG AATCATATTA CCAGCCGAAT 
AAGAATATTG GGATGTATAA AATCGCTAAA AGTATCCCX3A 
TTCATATGAT TCACCCTCTC CCATTAGATG ATTTrGTAAT 
ATACAATTAA GAATATAGCT ATAGTTGATC CCATACCATA 
GTTCCTCTAT T6CCGTACCT ATATTTATGA CTTTATTACC 
TAAATAATGA AAGTGATGGA ATAAAGGTTA CTTGAATCCC 
TTAACGGCAT GATTACTTTT CTAAAAGTAT AGAAAGGACT 
CCTGCAATAA ATTATTAGGA ATTGCriTCA TGCTATTAAA 
GTATATAAAT GTAACTTGCC ACTACTAAAA ACGCACCAGT 
ATGGTAAATT AAATAAGTGG AAAATTGA7T AATCACGCCA 
AAAAGCATAT GTCTTTAACA ATAAATTTAT CCATGTTGGA 

'>«*xxv,v^>^v. IjAUTAATATA ATAGGCAGnT 

AATAATTGTT ATTGAAGCGG CATATAAAAT TGAATATGCA 
AGTAAAAATT 

(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 



TGAATGAATA 
TATACAGCTT 
GATTAAAATG 
ATTTTGAATT 
TGCAATTAAT 
AGTCATAACA 
GGCACCTAAA 
TATAGGTAAA 
TGTAAATAAC 
TCATGACTTA 
ATAATCATTA 
GGATAACTGA 
AACATTTTCA 



GAACATTAAA 
TCCATACCAT 
AATGCCATAA 
GTTAAAAATT 
CTTGTAATCA 
CCTTCTTTTG 
TCACTTGAGG 
ATCATAAATG 
AAATTGAATG 
ATAAACCTAT 
TCATTAATAA 
TAGTCAAGGT 
AATATTTTGT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1690 



i3i> 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


487: 








ACGAAAAGAA 


ATATTATGAT GAACAAAAAG 


AAAGAATAAC 


GATTTATATG 


AAGTACAATG 


60 


40 


TGAAAGGTTA 


TAAAAATATA AGCTTCGCTA 


ATTTTAAAGA AAACCCAATG 


GATGGTTATT 


120 




CTATTAGTGG 


TTATATAAAT AAtGaTAAAA 


AGTTATCATT 


TACAGCTGGT 


ATAAGATCTG 


180 




TTGATGATTT 


TCAATTTGAT ACCGATATTT 


CTTATACAGA 


TGAATTGGGT 


AGAAAATTTA 


240 


45 


ATAAAAATCC 


TAAGTCAGTT TCTGAAATAA 


AAAAAGAGCA 


AAATACGTCC 


AATAAATAAT 


300 




TGTTCATATT 


GTGATGAAAC AAAAATATAA 


GTCATTAGAT 


GAGTTTAACT 


ATGTTATAAA 


360 




TATTTGTAGT 


ATCTATAAAA ATCTCGACAC 


TATTAAAATG 


ATAAAGTGCC 


GAGGTTTTCT 


420 


50 


TACTTATTTA 


GTTAATTCAA AGTTTATGCC 


AGATTCATAA 


GAATTTGTGA 


CACTTTTAAT 


460 




AGTGTAcCAT 


TGATTATTAC AATTTATCAA ATGGTCCTTT 


AGAAGGTATA AATAACAAAA 


540 
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AGTCCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA 1860 

GCATAATGGG ATTGTCCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 2040 

GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGITCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

15 2224 
ATTT 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



30 



60 
120 
180 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 
ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTTCTT TTTCAAATTT 
CTTAATTAAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 
ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 

TAATGCACCT ATAATGAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATOTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 360 

TAAACTTCAA CTCATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 42C 

'W' TCGTCTAGTG AATAAGTTAA AGCCATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 48( 

GGAATAATAA TGTTGCTTAA TAATTGTGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 54 ( 

AACATATTAT TATTCATyTC ATACAGTTGT GGTAAGACGA TAATCACAAC TATAGGTATG 60< 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 6 6i 

CCAATCGTTG TAAACATAAT TAAGAATGAT GCACCTATGA CAACGTCGGA TCATACCATC 72 
AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 
GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 



SO 



780 
840 



55 



1413 



;OCI0: <EP 0786519A2..I.> 



10 



IS 



30 



35 



40 



AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 



ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 
GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 
50 GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 

AAACAATCTG GriTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 
ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTPAAACA 
TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTAITT GCGATGATGT GGGAATAGTT 
ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 
ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 
TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 
TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 
GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 
GTCGAGGCGG CAGACGATGT TTAITTAGAG TTCCGTCATA CATCTGTCAA AQAAAGACAA 
GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 
ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 
CTAAAGGATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AQAACGCCGC 660 
GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 
25 TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 
AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 
AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 
GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 
GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAGCAACa 
aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TCTTTTACTG 
CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 
GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 
GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 
GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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1560 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 

10 



20 



30 



45 



ACAACCAATT 


TTACTAAACA 


TGGTrrTCAT gCATTTTCAA TATTTAGATG 


TCGACCGCAT 


60 


AATCGAAGAA 


TCGCCGACAA 


X X n\m, X X A X 


1 V3/\u X X n 




ATATTTCTAG 


120 


AGATCGTCAT 


GAGAAACGAT 


t\ X A X OVax^ X A X 


X u/\rkv7rl/%/\ X X 


TTAAATHATG 

X XAfVVX\MAX\9 


GTATCGATGT 


180 


TCATACCACT 


TTGaACATTC 


OldV—AX AX XWA 


/vIVj 1 X X Anvi i- 


AfJTrJVAATTG 


AACTGATGAC 


240 


CGGTGTACAT 


GTTAAAGAAC 


V3 X\J X^^^^^VjA. 


^ X AX X X Iv^iXxi 


ATGAfSPGPr'fl 
t\ X VJnU^u^^ v9 


ATGTATTAGA 


300 


AGTCGTAGAT 


ATCTCACCTG 


2k n r* zv jiT*r a a T 


X/vviv.U^'X X/\ 


A A AG fTTGGCA 


AGGTATATaA 


360 


AAAGGATAGG 


CTAGATGTAG 


V»x\X X XnOX/vn 


1 X 1 V« X X X 


TATGPr*PAPP 

X A XOVv%vV.-n\vV— 


TAAGCGAAtG 


420 


CGTACATTGA 


CGTTAAGAAC 


X X V^O/\V_ 


1 X X VJ X 


ATAAACAAAA 


AGTCCGACAC 


480 


AACCATAAAA 


CGTCACTCAA 


X X A X X 


nnrrmnpAA 

VJ ^ X VJ X V>-nn 


TTAGIVSGGAG 


CATTTATAAT 


540 


GAAGCAGTAA 


TTAAAGAGGC 


X X^\MiX^X.^^X X 


GCTCAAAAAG 


AACATGCGAa 


GTTCACTGCT 


600 


ATTTATATAG 


ATGTATTCGA 


AAAAAACAGG 


CAATATAAAG 


ATAGTCAAAA 


GCAAGTGCAT 


660 


CAACATCTCA 


TGCTTGCAAA 


ATCATTAGGA 


GCAAAAGTAA 


AAGTAGTTTA 


TAGCCAAACC 


720 


GTTGCATTAG 


GATTAGACGA 


ATGGTGTAAA 


AATCAAGATG 


TAACCAAATT 


AATTATCGGA 


780 


CAACATATTA 


GAAATAAGTG 


GCGAGACTTT 


TTCAATACAC 


CTTTAATTGA 


CCATTTAATG 


840 


TCCTTTGAAC 


ATAGCTATAA 


AATCGAAATC 


GTTCCAATCA 


AACAAATACC 


TGTTGAATTG 


900 


AAAATGAACA 


AATCACCCTA 


TCGTCCTAAA 


GGCAAACGTT 


TCGCCATAGA 


TATGTTAAAA 


960 


ATGATTTTGA 


TTCAAATAAT 


TTGTGTAATG 


ATGGGACTGT 


GGATTTATCA 


ACTTGATAAG 


1020 


CATGAGTCTA 


GTACGATTAT 


TTTAATGATT 


TTTCTCATCG 


GCATCATTTT 


ATTATCCATT 


1080 


TGGACGCGGT 


CCTTCATCAT 


TGGCTTTTaG 


CAGCAATTAt 


TAACGTATTT 


GTgTkTAATT 


1140 


ATkTTTTtAC 


GGAACCTA 










1158 



(2) INFORMATION FOR SEQ ID NO: 4B5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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TTACTTGTXSA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 
TTGCACCAGA ACATTTACmA CAAGTATTTg ATCGTTnTA TAAAGTTGAT GCAGCGnAnA 
ACCCCGnGGT AACCAnGTA 

(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 
GAGCTGTTGT TACTTTGATG CCTGCAGCTT TATTACGGCT GACTTXSGTAA TCATAAGTTT 
CAGCATATTG CTCAATATAT GCTATATCAT ATTGAATGGT ACGAGGTGAT ACACCAAGTT 
GATTAGCAAT GGTATTGATT GGAATAAACG TTTGCTCATG AATTAAAAGA TACAAAATTT 
CGATTTGTCT ATAACTTAAC AACGTAATAT CCTCCTATTT GTAATTCTAA GCGATTTCTT 
AAAAACGTAG ATATGCAATC TCTTTCATAT TTTAATCCGA AAAATTCCAT ATCAAAATGT 
TTATGGCGCA AGATTTTATA GGAACTTTTA AAATAAATTA rATATTCATG TTGACAATTT 
AAAAATGTCG CAGTATATTT AGTTAGACAT CTAACGAAAT GGTCGTCCAA TAAATGGAAT 
TCACTTATTC GTATTTATTT AGAATGATTA GTCATGAGAT GAAACAAAAG GCTGATCAAA 
AGTTAGAGCA ATTTGATATT ACAAATGAGC AAGGTCATAC GTTAGGTTAT CTTTATGCAC 
^ ATCAACAAGA TGGACTGACA CAAAATGATa TTGcTAAAGC ATTACAACGA ACAGGTCCAA 

CTGTCAGTAA TTTATTAAGG AACCTTGAAC GTAAAAAGCT GATCTATCGC TATGTCGATC 
CACAAGATAC GAGAAGAAAG AATATAGGGC TGACTACCTC TCGGATTAAA CTCGTAGAAG 
CATTCACTTC GATATTTGAT GAAATGGAAC AAACACTCGT ATCGCAG1TA TCTOAAGAAG 
AAAATGAACA AATGAAAGCA AACTTAACTA AAATGTTATC TAGTTTACAA TAAATGATAA 
GTGTGACTGG TAGAAATCAG TCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTCTT 
AGTAAGCTAA TTATTGGAAA AGACAAGGAG TATTOAACAA TCAAAGACGA ACAATTATAT 
TATTTTGAGA AATCGCCAGT ATTTAAAGCG ATGATGCATT TCTCATTGCC AATGATCATA 
GGGACTTTAT TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 
^ GAyAGCCACA TGATTTCTGC tAatCTCTCT AACACTGCCA GTATTTGCTA TCTTAATGGG 



GTTA 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 
AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 
AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 
TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 
AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 
(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotlble 

(D) TOPOLOGY: linear 



1500 
1560 
1620 
1680 
1717 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 482: 
GTAAATCTGT TACTCGAAAT GTAACGATTA AAGAAAAGGG CTCATCTCAA ACATATATTT 
TGTTAGGCTA TCCAACAAAA GCACAGAAGA ATAGTCATAG CAAATATAGT GGAGTCTTTA 
TATATAAAGA CTTGAAATCA ATCGAAGATA CAAATAATGC TATTACGATT ATCACCATAA 
TTACGGCTGT TATTTTCTTA ACAATTACAA CAGTCTTTGC GTTTTTCTTA TCGTCAAGAA 
TTACAAAACC TTTAAGACGT TTAAGAGACC TAGCTACACG TGTATCTGAA GGGGATTACT 
CTTATAAACC TTCTGTCACA ACGAAAGATG AAATTGGTCA ATTATCGCAG GCATTTAATC 
AGATGAGTAC AGAAATCGAA GAGCATGTCG ACGCATTATC CACATCTAAA AATATTAGAG 
ACAGCTTAAT TAACTCTATG GTAGAAGGTG TCCTAGGTAT TAATGAGAGT CGACaAATTA 
TCTTATCTAA TAAGATGGCG AATGATATTA TGGACAATAT TGATGAAGAT GCTAAAGCTT 
TCTTATTAAG ACAAATAGAA GATACTTTTA AATCAAAACA AACTGAAATG CGCGATTTAG 
AAaTGAATGC ACGATTCTTT GTTGTGACCA CAAGCTATAT CGACAAGATT GAACAGGGAG 
GTAaAAGTGG TGTTGTTGTG ACAGTTCGTG ATATGACTAA TGAGCACAAT CTAGATCAAA 
TGAAGAAAGA TTTCATTGCT AATGTATCAC ATGAATTACG TACACCGATA TCATTACTTC 
AAGGTTATAC TGAATCAATT GTAGATGGTA TTGTTACAGA ACCGGATGAA ATAAAAGAAT 
CGCTTGCCAT TGTCCTTGAT GAATCGAAAC GTTTAAATCG TTTAGTTAAT GAATTGTTAA 
AIOTCGCACG CATGGATGCT GAAGGGTTAT CCGTAAATAA AGAAGTTCAG CCTATTGCAG 
CGTTACTAGA TAAGATGAAA ATTAAGTATC GCCAACAAGC TGATGATTTA GGTCTAAATA 
TGACTTTTAA TTAyTGTAAG AAGCGTGTTT GGAGTTATGA TAwGGATCGC ATGGACCAAG 



60 
120 
180 
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960 
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(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 
AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 
AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGITAAAA AAGCTGTTCA 
AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTCGTATCG TACATGCATC 
AATTGGTAAA GTTTCATITA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 
20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 
ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 
TAATTTCCAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 
TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTCACTTGA ACGTGATGAT CTATGATCTT 
TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 
AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 
ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 
TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TCAAGCTGGT GTTGAGTATA 
AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTCGTATC GAAGGCTTAG 
ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTCCAG 
CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 
TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 
ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TrCGCTTATG 
45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAAmTTA GCGTAAAAAA 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 
AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 
TACTGCAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTX^GT GGCGCTGACG CTGCAGCAGA 
AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 
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GACGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4620 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4680 

CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4 74 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4 800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 860 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4 920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCX; ATTTAGCAAA CTACATTAAT 4 980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 504 0 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATG 5160 

GCGAGTTTAG TCTATAAACT AGAAGAAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 52 80 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 534 0 

25 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 54 00 

ATGAATGCTA ATATCAATGT AGAAGGTCGT aGTGCTAAAC TTGAAGGTAA AAGTCAATTG 54 60 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 5580 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 5640 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 5820 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 5880 
GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 5940 
TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 6000 
AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTOGATGGTC 6060 
AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 6120 
ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 
SO TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 624 0 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 6300 
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CAAGATATCA 
GAACGCGCAC 
ACTGGTCAAA 
TTAGATGGTA 
GATGTTATTG 
AATGAATACA 
TGTTGAACTC 
TCCAACTGTA 
ATATATTGCT 
TCAGACTGCA 
AGCAGAGTCT 
TTTAGAGAGA 
AAGATGTGAT 
ATGAGAGGGG 
ATCATTAAAC 
AAGAAATACT 
CCAAATACAA 
ACTTTTTTAC 
TCGACATTTC 
GATATTTATG 
TAAACTTTGT 
TGATAAAATG 
TATTTGTATT 
ATACGCCACT 
AAGCAATTTT 
AGTCTAACTC 
TTTAAAATAG 
ATAGTAATCA 
GCAGTATTAC 



TTGCTATCTT 
GTAGAATTCA 
AAGGTTCTTA 
AATATGACCA 
CAAAAGCTAA 
TTAAACCTAG 
GTTGTTATGC 
GCTGCTTTAA 
GTAAGCGATG 
GAAACTGCAA 
CACTTGGAAA 
GCAAATAACC 
TTCATATCTT 
GGAAGGAATT 
ACAGCTAATG 
AATTTCTAAA 
CACTATAGAG 
AATCACTAAT 
ATTTTTTGAA 
TAAAAAGAAT 
ATCATTATTT 
GATTATATCG 
GCCTATTGGG 
CAATTACAAG 
ATTGTAGATT 
TATGATTTGT 
GGAAATGTAA 
AAGGTGGAAA 
CAATATTGAC 



AGGTATGGAC 
ATTCTTCTTA 
TGTACCTGTT 
TATTCCAGTUX 
AGATATGGGT 
ATATTGTCAC 
AAACAACAGC 
AAACAGGCTT 
GCTTTGTTGA 
GAGAAATTGA 
ATGATGACGA 
GTTTGCGTGT 
CGACCCTTTT 
GATAAAGAAC 
TGTATTTT^AA 
GAAAAAGTAT 
TCTAGACATT 
AGATTGCTAA 
ATTATCTACA 
TATATAGTAA 
AAATTTTTCA 
GACAATATGC 
CTTTACAATC 
TGTGTATGAT 
TGTTACAATA 
AATCAAAACT 
AGTAATAGGA 
TAAATTAACG 
AGCATCTTTA 



GAATTATCTG 
TCTCAAAACT 
AAGACAACAG 
GATGCATTCC 
GTTGAAGTAT 
TCCTAATGGT 
TGGTGAGATA 
TGTAAAAGTG 
AGTTAGAAAA 
TGTTGAAAGA 
CAATACTGAT 
GGCTGAATTA 
TTGAATTATA 
CATTAAAGAT 
AATAGGaayA 
TTCTTTATGT 
GATTTATGTC 
AATCAAAATT 
TTTTTTCATA 
GTTAGCTTAA 
TGTACAATGT 
AGTTATCCAT 
AATTAGATTA 
ATTTGTTGCT 
CTCGACGCAG 
AGATATAATT 
GTTCTAAGTG 
GGTGAAGTTA 
TTAGCTTCTG 



ATGAAGATAA ACAAACAGTT 2820 

TCCACGTAGC GGAACAATTT 2880 

TTGCAAACTT TAAAGATATC 2940 

GTTTAGTTGG TAGCATGGAT 3000 

AACAATTAGG AGGAATGGAT 3060 

TCTGTTTACA ATCGTGATAA 3120 

GGTGTCATGA GTGGACATAT 3180 

AAATTTCACG ATGGAACTGA 3240 

GATAAAGTTT CAATCATTGT 3300 

GCTAAATTAG CCAAAGCAAG 3360 

ATTCATAGAG CCGAAAGAGC 34 20 

AAATAGTAAA TAAAGGGTCG 34 80 

TTGATTTAAA GATACAAAAC 3540 

TTATGATGTA GTGGTTCTTT 3600 

CATgAGTAAA ACTCATGTAT 3660 

TGGGGCCCCG TCAACTACTG 3720 

CGACTCCCAA GAATAGTTTT 3780 

TCCTTCACCA CTATCTACAG 384 0 

CCAAGATATT TTATAGTTAT 3900 

ACTTTACTAA AAACGGGTAT 3960 

AATACAGTAA TCTTATGAGG 4020 

TTAGTGTTAC ATGTTGTATG 4080 

GATCAATTTT TTAAAAAAGG 4140 

ATTTTATTAG GCACTGCAGT 4200 

GTAAAATATT TAATAAAATA 4260 

AAATAATGAC TTAAAATAAT 4320 

GAGGATTTAC GATGGATAAA 43 80 

AAGTAGAAGG TGCTAAAAAT 444 0 

ATAAACCGAG CAAATTAGTT 4500 
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ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 1080 

5 AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

10 

CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 13 80 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACX3G AACAATATTA 144 0 

15 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 16 BO 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

2s GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

30 GAAACAATTG ACCTTAAAGA AGA7VATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 204 0 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCGCCGG TGTAGGTGAA CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

40 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTCCX3T 234 0 

GACGAACAAG GTCAAGACGT ATTATTATTC ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 246 0 

45 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2 52 0 

ACTTCTATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2 580 

so GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 264 0 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 2700 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGtItcc 
CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATCTCX3 ACGTTGAATC 
TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGrTCC AAGATACACC 
AGGTG7TGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TXJTATACAAG 
TAATATGATA TTTTATACGG 7TGACTATAA CCACGTTCAA TCTCAACTTA ACTTTAAGTT 
TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTX3TTT ATCATTAATC AAATTGACAG 
CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCXSAGTIXSG AAAAATCAAT 



1080 
1140 
1200 
1260 
1320 
1380 

1440 

TGGC 

IS 1444 
(2) INFORMATION FOR SEQ ID NO: 480: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



25 (^i) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

GCAGCAGCTT ATCGTGAGTT ATCATTATTA TTACGTAGAC CTCCAGGTCG TGAAkCATAC 
CCAGGTGACG TATTCTACTT ACATAGTAGA tTATTAGAAA GAGCAGCAAA ATTAAACGAT 
GACTTAGGTG GCGGTTCAAT TACTGCATTA CCAATTATCG AAACACAAGC TGGTXSATATT 
TCAGCTTATG TACCAACAAA CGTTATTTCA ATTACAGATG GACAAATCIT CTTACAATCT 
GATTTATTCT TCTCAGGTGT AAGACCAGCG ATTAATGCCG GACAATCTGT ATCTCGTGTT 
GGTGGATCTO CACAAATTAA AGCAATCAAG AAAGTTCCTC GTACGTTACG TCTTGACTrA 
GCGTCATACA GAGAACTTGA ATCATTTGCA CAATTCGGTT CAGACCTTCA TGAATTTACT 
GCAAGTAAAT TAGAACGTGG TAAACGTACT GTTCAAGTCT TAAAACAAGA TCAAAACAAA 
CCATTACCAG TCGAACACCA AGTGTTGATT ATTTATGCAT TAACAAAAGG ATATTTAGAT 
GATATTCCAG TTGTAGATAT CACACGTTTT GAAGACGAGT TAAACCACTC GGCAGAATCA 
AATGCTACTG AACTGTTAAA TGAAATCAGA GAAACTGGTC GCTTACCAGA TGCTCAGAAG 
TTTGACACAG CAATTAACGA ATTCAAAAAA AGCTTTAGCA AATCTGAATA ATAAACAAGT 
TTAGTATAAG GTGGTGAGAT AGTCGCTTCT CTTAAAGAAA TAGATACTCG AATAAAATCA 
so ACCAAAAAAA TGAAGCAGAT TACGAAAGCG ATGAACATGG TATCAAGTTC AAAACTTCGT 

AGAGCTGAAA AAAATACAAA ACAATTCACA CCATATATGG ATAAAATGCA AGATCCAATT 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 54 0 

CATGTTTACG TAAGTCGAAG TCTGTACXSAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 66 0 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 720 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 780 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 
GGaATGaTTT ACCAATTTGG ACCGG 
(2) INFORMATION FOR SEQ ID NO: 47 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

TAATGAGTAA ATAAGTACCA GATAAAATCA TGAATATCAT CCAACATGCG GTTAACTCTA 60 

CTAAATAATT AATAATAGTA TTTTCAGTAA ACAAAGAACT ATGTATACTT CGCATCACAT 120 

TAGAATACGT ATGTTTCGCA TTTTGATCTG CAACAAATTG ATTGTTATGA TCTAGAAAGA 180 

CGTAACGTTG ATTTCCTGCC ATATCACTCA GTGTAATTCG TTTGTTATAT GGTTCATCAA 240 

GTATGCTAAC TTTACTTACA AAGAATCCTT CATATTGTTG TTCAACTTGA TGTACTGCAT 3 00 

CATTTAATGT TTGATGCGTT TTTACATCAC TGTCACCAAA AAACTCATTC TThTPAATAT 360 

35 TATTTTCAAC TTCTGGAAAG AACAAGTAAC CAATGCCCGA AATGGTTAAA GTGATTAACA 420 

GTGGAGCAAT AAATATTGCT GCATAGAAAT GTAATCTTTG TAATGGATTA AATGTATTTT 4 80 

TCATATTTCC CTCCCAATTG GCTATTATAC GGTGTCAATT CTGTGATGTG TGTGAACAAA 540 

CTGTGACAAT ATTTATTTTC TAGAAAAATT TAACGATGAT TTGTGATTTT TAGAAAAATG 600 

AACTTTTAAG TTGGAATGTT TGAAGAAAAT TGATTATTCG TATGTTTTAT CAAGCAGCTA 660 

TGATAAAATT TAAACATAAT ACAATGCGAG CCATTTAACG ATCTATGTTT AAATGGACAT 720 

CGATATTGTA TGAATTCGTT GTAACAAGCA AGCATTTCTA TGTGAACGAA CCAAAGGGGA 7 80 

AAGTAACATG ATTAATAAAG AACAATTAGA TCTTTTATAT AAATTAAAAA AAGAAGTTGA 84 0 

AAAGTCGCGA AATGAAGCAC TTTTACATAC AATTAACCAA GTAATTAAGA AAGTATATTT 900 

GCAGCAATAT ACATGTTCGT TCGTTGGACA TTTTTCTGCA GGTAAATCGA CACTGATAAA 960 
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ATTGAAAAGT ACGATGGTCA CATTCTAAGA TITGTGTCTG ATCAAGACGA 
GGTGTCCGCA ATCATTTATA TGAAACTGCT GGAGAAAAAA TAGTACTTAA 
GGCCATGCAA TGAGTGGTAT TTTAATGAGC AGAACACAGG CTATAATCTT 
AACAAAGTTA AAGGCTACCA AGACGAAAAT AATAAAGCAT TAAAATCCGT 
ACGAGGCATA GATTACATAA AGTAGAGACG TTAAGAGCGA ATTGGATTCA 
GGATCACTCT CTTCCTCCCw ACAACAATTA TTAGAAGCTT TAACAGCACT 
GAAGGCTTAA ATCAATTAGT GAATGAAGAA AGCCAACATT TGAAAAAATG 
TGGCACATAA ATTTGGAGAC AACTGGAAAA AAGCGCAAGA AGTTGGAAAT 
AAAAATTAAC CTCTGAAGAG GTTATAGATG rATTAAGAAA AGGTGGCGCG 
AACTTGAAAC AGATCCCAAA AGAAAAATTG ATGATAAGAT AAAGAAATTA 
ATAAAAATTG TAATGGCTAT ATCGCAAAAA TTAAACAGAG TATCGAAGCA 
ATGACCAAAT GTTAGCGAGC CAGATTGATC GGATGATGTA ATGTTTACTA 
TATTAATGAA CTTGAAAATG CCTATGATCA AGAAAGAAAA CAATTGAATG 
TCAAATTGAT GAATTAAGAC ATCAAACACG CAAGAmATGT GAACAAATCT 
CTTATATCTC AAACATAAAA TGAATTmyms TGAAGACGCT ATGATCAGGA 
TATAGAATCT TTCGATAGAG AAACGAATCA ACGTATCCGA CATCACGAAA 
(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



ATTAGATGCA 
AAATGGAGAA 
AGCTGAATTA 
TCGTAAACAA 
AACAACGGGT 
AACCATTGCC 
TATCACGCGA 
GAAATTGGTG 
TATGAAAGTa 
AATGATGTTT 
ATTGTTTCTA 
CGTATAAnAA 
ATGCATTCAA 
ATGATCATTT 
TGACACGTAT 
TG 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2332 



40 



4$ 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 
TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATCGT 
TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT 
ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTCAT TATCTTCTAA 
TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT 
GAT-rOTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA 
GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT 
GATGGAAGTG TAAAACTGTA CGTX3CATCTT TACTACCTTC AACGCCTTCT 



GGAAAGTCTT 
TGGCATACGA 
TGAATCAAAG 
TTTACCGATA 
AATCTTAATC 
GCTAATGCAG 
TCATCATATG 



60 
120 
180 
240 
300 
360 
420 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 
(2) INFORMATION FOR SEQ ID NO: 477: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 





GGTTATATGT 


TTGGTATTAC 


TCATTATATT GATTGGAGGT TGTGTCATTA 


TGACAAAAAC 


60 


75 


AAATGGTCGA AACGCTCAAA 


TTAAAGAAAA TTTCAACAAA ACATTAAGTG 


TATATCTAAC 


120 










TTGATAAAAG 


180 




CAAGAATCTC 


GATGAn-l-iT 


ACGATAAGGA AGGTTTTCGA GATCAAGAAT 


20 


AGATAAAGGG 


ACTTGGATTA 


TTTATTCTGA AATGGTTATC GAACCAAAAG 


GGAAnAATAT 


240 


GGAATCGAGA 


GGAATGGTGC 


TCTATATCAa TCGCAATACT AGAACmACGA 


AGGGTAATTT 


300 




TATTGTCACC 


GAAATAACTG 


AAGATAGTAA AGGATATTCA CGTAGTAAAG 


AAAAAAAATA 


360 


25 


TCCTGTCAAG 


ATGGAAAATA 


ATCGAATTAT TCCAACAAAG CCTATACCGG 


ATGACAAGTT 


420 




AAAAAAAGAG 


ATTGAAAACT 


TTAAGTTCTT TGTACAATAT GGAAATTTTA 


AAGATTTTAA 


480 




AGATTATAAA 


AATGGTGATA 


TTTCATATAA TCCTAATGTG CCAAGTTATT 


CTGCAAAGTA 


540 


30 


TCAATTGAAT 


AATGATGACT 


ATAATGTTCA ACAGTTAAGA AAACGATATC 


ATATTCCAAC 


600 




CAAACAAGCG 


CCCGAATTAA 


AATTGAAAGG ATCCGGCAAT TTAAAAGGCT 


CATCCGTAGG 


660 




ATCTAAGGAT 


CTAGAATTTA 


CGTTTGTAGA AAATCAAGAA GAGAATATCT 


ATTTTTCAGA 


720 


35 


TTCGGTCGAA 


TTTACACCTA 


GCGAGGATGA TAAATCATGA GTCAAACGGA 


ATATCAAATA 


780 




AAATCTGGCA 


ATATAAAAGG 


TAACTCTGAA GAAACAAGTA CAGTATCTAA 


TATAAGTTAT 


840 




GAAATAGAAA 


ACGCAAATAA 


CAGTGGTTTA AAACAAAATA AAATTGATAA 


ACAAATTAAA 


900 


40 


AAGTTACAAG 


AAAAAAATAA 


ATTCCCTAAA AATCTTTCAT ATCTTAAAAG 


TTATACGGAC 


960 




CCCAAAACAG 


GCACGACTAC 


AAGCGCCTTT TTAAATAAAG ACACTGGCAA 


AGTTACTTTA 


1020 


45 


GGTATGACAG 


GTACTAATGT 


ACACAAAGAC GCAATATTAA AACAAACATT 


TGGTGTTCCT 


1080 


TCTTATCAAG 


GATATATAGA 


CGTGAGTGAA ACgCTaAAAG ATATTGGGGC 


CGATGTCAAT 


1140 




ATTGGCCTTC 


ATTCCGTCAC 


AGATAAAGAT CCACATTATA AAAATACCCA 


AGACTTTATC 


1200 


50 


AAAAATATCA 


AA7VAAGACTA 


TGATATTGAT ATTATTACCG GACATTCGCT 


GGGCGGTAGA 


1260 




GATGCGATGA 


TTTTAGGTAT 


GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 


1320 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGctACGCAA ATAAATTTAC 
TCGAAAATTA CATCCGTATG PJKGGTM^Q^ GTATGAATTA ATCATAGATA TTTTGGATAA 
TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTrCTTCAA CATTAArTGC 
TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 
AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 
(2) INFORMATION FOR SEQ ID NO: 4 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 4 76: 
TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA 
ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 
25 AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 
AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 
30 PJ^OPJ^TTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA AAAATATTAT 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT TGATCATTTT AAATCGTTAG 
GTGGAGAAGT TGAAGCATTT TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA 
ATAGAAATCA TAGAAAAATC ATCGTAATCG ATGGTCAACT AGGTTATGTC GGAGGATTTA 
ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 
GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 
AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 
TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 
AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 
ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 
ATGTACATTT AATGATTCCA TGTAAGCCAG ATCATCCATT AGTATATTGG GCGACATTTT 
so CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 



660 
720 
780 
840 
887 
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TGACCAAGTA CGTTTCGAAG TTGCCATTAA AGCATTAAAC CCATCATTGA AAGCATTCGC 60 

ACCTGTACGT GAGTGGGCAT GGAGTCGTGA AGAAGAAATC GATTATGCAA TTPJ^CA-thK 120 

TATCCCTGTA TCAATCAACC ATGATTCACC TTATTCTATC GATCAAAATC TATGGGGCAG 180 

AGCGAATGAA TGTGGTATTT TAGAAGATCC TTATGCTGCG CCACCAGAGG ATGCGTTCGA 24 0 

TCTAACTVAAT GCTTTAGAAG AAACACCAGA TACTGCTGAT GrAATCATTT TAACGTTTGA 300 

TAAAGGCATC CCAGTTCAAA TTGATGGCAA AACATATGAA TTAGACGATT TAATTTTAAC 360 

GTTGAATGCA TTAGCTGGTA AGCATGGTAT CGGAAGAATT GACCATGTAG AAAATAGACT 420 

TGTAGGTATC AAATCAAGAG AAATTTATGA GGCACCTGCT GCAGAAGTTA TTTTAAAAGC 480 

GCATAAAGCA TTAGAAACGA TTACGTTAAC GAAAGATGTC GCACACTTTA AACCAATCAT 54 0 

TGAGAAGCAA TTTGCTGAAC AACTATACAA TGGACTTTGG TTCTCACCTT TAACTGATAG 600 

CTTGAAATTA TTTATTGATA GTACTCAGCA ATACGTAAGT GGTGATGTCA GAATTAAATT 660 

ATTCAAAGGT AATGCCATCG TGAATGGTAG AAAATCACCT TACACATTAT ATGATGAAAA 720 

ATTAGCAACT TATACAAAAG AAGATGCATT TnATCAAGAC GCTGCTGTTG GCTTTATCGA 78 0 

TATCTATGGT TTACC "^^^ 
(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATGr GaAATATTAT 120 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 24 0 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 300 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 360 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 

GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 54 0 
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CCtTCAACXSA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTOSGAT ACGAACATCA 
TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTCATAT CTTCAACTGA GATATTAAAT 
TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTm-TCA 
GCTTCAGCGA TCGCAGTTAA AGTrAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 
ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 
AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 
ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CrrTTGTAAT CGCTTCTTCT 
TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTrTTCT 
TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 
TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGnT TGAAAGTTGC TTCTTTACCG 
GCTAAITCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 
ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATCA ATGAACCTGA ACCGATTTCT 
AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 
2S TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGITrCT 
TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 
GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTCT 
GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 
TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 
ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTITAA CCACTTTTTT GAATGCTTGA 
TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 
^ TCCTTTTTTT CCCAAGTTGC TGTCATCTAT ATATACCTCC ATGATTAACT AATrrATTTT 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 
TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 
^ (2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
so (D) TOPOLOGY: linear 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACX3TG AaGCCTTTAC CAAGTTCTTT 5760 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCTTT 5820 

5 ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5 940 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 624 0 

TTTACGCTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 63 60 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 64 80 

CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CriTTTGTTTT 6 54 0 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 66 00 

AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 6660 

TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 6780 

GATTCCTTCT TATAATGGTA TGTCTT 6806 
(2) INFORMATION FOR SEQ ID NO: 473: 



20 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1716 base pairs 
(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS t double 

(D> TOPOLOGY: linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 24 0 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 


- 

3 960 




CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 


4020 


5 


AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 


4 080 




TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTC TTTGATTAAT GCTrCAAGTT 


4 14 0 


10 


CrrTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTOT TAAACGATAT AACTGTAACA 


^ O rt 


TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 






CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTATCTAAA ATTGACAACG 


4 1 *5 A 


IS 


CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACITCG 


^ J o u 




TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 


A A ^ f\ 




CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 


^ 3UU 


20 


AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 


A C ^ f\ 

4oo0 




CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 


A c: n 




CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG ITCACTTCAT 


4680 


2$ 


ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCX5 TAAAGTTTCT TCTTCAACTT 


4 74 0 




TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC TTTTTTAATA CCATCAATAC 


4 800 




CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 


4 8 6 0 


30 


CTGTAATATC CGGATTATCA ATATATTTAA GTGTTCCTTC AATCACTTCA GCTAAATTAT 






GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 


A Q Q n 

4 y o U 


35 


TAGGAAATCT TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 




AAGAAACTGT CTCTTTATTA ATATCACXJTA ATAACTCTTC AGCTAGTAAG CTTAACTTAG 


D X U U 




CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCXSAT ACTACCATTA rTACCATGCA 


^ X o u 


40 


TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 






CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCXSCACTTT 


5280 




TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 


534 0 


4S 


GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATAITTAC 


54 00 




TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTnar- 


54 60 




TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 


5520 


SO 


TAAAATACTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 


5580 




AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 


5640 
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GCAATATTAC TAAAACCATA CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2220 

5 TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT 2280 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2340 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACX5GAATC 2400 

CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 2460 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 2520 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2580 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 264 0 

AAGCTCTAAA CATTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 2760 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2820 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2880 

GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 294 0 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3 000 

ATTTAGTCAA AATTTAACAT TACTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3060 

TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 324 0 

GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGC/VACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGATT 3420 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 354 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC GTTGTTTTAA 3600 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 

CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3780 

GTATAAATAG ATAACGACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 3840 
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30 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATCCATAGAT AAATCTTCTX3 TATTaGATGC ' 35 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 42 

TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 43 

CATAACAAAT GGATTGCTAA TAACTTGATT TTCTTTTAAG CGTTTATAAG CAAATATAAT 54 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 60 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 66 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 721 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 78 ( 

AACTTCTTCA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 ( 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 90( 

2^ AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 96 ( 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 102C 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 108 C 

25 CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATA-TTCATC TATCCAACAA 114c 

^ TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 120C 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TCATACAGAT GATAATTGAG 12 60 

TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTPJKPJKATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA ISOO 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 1560 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 162 0 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 1680 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 1740 

45 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTC 1800 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 1B60 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 1920 
SO GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 
AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 



35 



1980 
2040 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



£ 



10 



20 



30 



X Xrxx XXX 




X X ^ X ^uvjr X x/^ 


TTTAACCATC 


GGTTTCAATC 


CTACGTGCTT 


60 


VbAL X X X w X a X 


\»L>AX XnnX XV>. 


nO XXX ^nVJV* X 




CTGTATAAAG 


GGTCAAAGTT 


120 




1 X V— f\ X 


X X\ X XW9\JA.V»^ii'% 


AC^TCGATAT 


CAATATTTTG 


AATACGAAAA 


leo 




X 1 1 X V..>1X V, 


PA AAAATAAA 


#\ X U%«^V3 X X X ^ 


GTACACGCAA 


ATGGTGTTTT 


240 




n X ^ X uV_Av7 1 Vj 


V. X X A X A X ^ Wra 


TAGAr*TGGCG 


CTATAACAAG 


CGTGAAAACA 


300 




rVXAX X>\l/\riX 


P A A r"*T A PT A 


APAATfSTiATA 


AATGGTTATC 


TAAAATCTCC 


360 




X X l—AAAA X 


A TT* A A A TfirVZ 


ATTAAAAfSTA 


CAAGCGCTAT 


GGGTATCCAA 


420 




W\\. X X X 


U-A X (.^V, X X X X 


X X A^jV-Vj XVj X o 


RmAPATAAA 


ATTATAATCC 


480 


CTCATCATTT 


TCACCTCTTA 


7\ A 'T*7\ r^r^ a ty^ a 


T^fip/^a aT*'|*T A 
X X X\_a/\X X Xr^ 


X X XwVvJ^XwX X 


^^-sJaVw X X \-. X X V- 


540 


GTATTTTAAG 


TTAATCGATT 


ijrviv^L. X WiU 


X X X Ai^\..x\ 


A P A A T* AJLAfir* 


TAAAATTATT 

X xax^xv^ X x,<^x X 


600 


TAAATTGTTG 


CGI X iAAOXA 


AiOXAXoX XVj 


PP A AnTl^TP A 


AATC CTATAA 


TGTGATGCGC 


660 


TTTAAAATAA 


TAAATATTTC 


GTTTCAATAG 


CTCGAAATTC 


TGGATAGTAA 


TTTGTTCTTC 


720 


TGTCATTTTA 


AAACCCGCAT 


GTTTGACATA 


AAGATATCCT 


TTGATCACAA 


ATAAACCAAT 


780 


AATGACTATT 


GTTATAATCG 


TAAATAACAA 


TAATAATTGA 


TTCCAAAAAT 


AACAGCCTAT 


840 


ACCTGCCATA 


GCTATGACAA 


TAATACTAGG 


TATTAAAAAG 


TGTCTGTGGA 


AACCTGACAA 


900 


AGGCATACCT 


TCATTAACTT 


GTTGATAAGA 


TAAATCTGGT 


ACTAAATTCT 


GGATAATTTG 


960 


ATATGCTTTG 


TCTCGTTTAA 


TAAACGGCaA 


TATCGGCACA 


CTACCTGAAT 


CATTGTC 


1017 



35 (2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

45 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 
GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 
TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA TATCATCTTG TGCATGTATA IBO 

SO 

TATTCTTTTC TTCTTAATCG GCG/VATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 24 0 
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ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACTCATAATA 
GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGGT GAAAAATACA 
CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 
ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGACGACA 
GTAGCX3AAAC TGGTAAAACG CCAGAATACT CTTTAGATCA GTACTTAAGA TATGGATTTG 

10 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 

TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG OTTTTATTCT CCTGAAACAT 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA 

CTGTCAGTAC GTTGGAATTA AGAACAGCTA TAAAAGTTTT TGGGAAAAAG TACACAGCCG 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 

TAAAAGAAGG TACTTATCGT ACCGCAACAA TTGGGTCTAA AGCTACTATT AACTTTGATT 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 

ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC TTGTTTTGCT AAGTCGGTTC 

AGTCTGAmAC AATAGATTTA ATAaAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 
30 

CTTGTATGTA TGTTGGAACT GAAAAATC7VA CAGTCTTAAA TTTAATTGCT GACAACTCAG 
GTCGCAATCA ATACAAAGCA ATTGTTGaCT ACGTCGCAGA TAGTGCAAAG CAGTTTGGGA 
TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 
TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT TATATAGGTT 
ATGAAAAAAT AGAGCCAAGA GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 
40 CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAACGCA ATAGATCAAG 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 
TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCG TATAAATCAT TTATACACTA 
GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 
AGAAGAAGTT AAAATCAAAG CGCTACTTGA AAACGATAAA C 
(2) INFORMATION FOR SEQ ID NO: 471: 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG AGCACACTTA CATTTCCAAT TGATGCAAGG 6 54 0 

GTCGCATCCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

TAGTX^GCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 6660 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG GAAATATCAT 6720 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG 6780 

AGACATCAAC GTTTTAC7VGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 684 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6900 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

AAGTCGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAGCAATTCA 7080 

ATTAACTGAA CAGGTTATGC GCATCATCX3G TATGGATGGC AAGCCAAATA ACATCACTGT 7140 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 7200 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 7260 

25 TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 73 20 

AAATCCAAAT AATTATATGG GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 73 80 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 744 0 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 7 500 

CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGATGGTATA 7560 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

GTTACATCTA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAGTTT AAATGTTTAT 7740 

AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

TTTACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CGATTTGAAA 7 920 

ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 7 980 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 8040 

GCATATCGAG ATATAAACAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 8100 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 8160 

TTTCCTTTTA TTTATAGGTA GGTGATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 8220 

55 



30 
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30 



35 



40 



45 



SO 



GTTGGTGGTC 
AAGAAGAAAC 
ACGGCGTAAA 
TTGGCGGCGA 
CAAGTTGGAG 
CTGGTAAATG 
ATCAAACTAA 
TTTGGAGAGG 
TAGATATGGC 
ATGCAAAATC 
AAGG'ITGGAC 
CGGCATGGTC 
ATGATTGGAT 
ATGTTGCTAA 
CTAAAGCCAT 
TAGCAGGAAA 
TATTAAATGA 
GGGCTGACGG 
GAGATAGTGT 
AATTCCATGG 
AAAAAGCAGG 
CAGAAGAAAT 
ATAAAATCGG 
TGTCAGGTTT 
CGCGTACTCA 
TGGTGGCGGA 
TTACACAGGT 
TACAGGAACG 
CGGTGGCGGT 



TAACTTTAGA 
AGCTCGTAGA 
AACTAAAACT 
AATGAAAAAA 
CTCAGCCAAA 
GTTTGGAAAA 
GCAAAAGTAT 
CACATCAAAA 
TAATAAATCG 
CGTTTGGAAA 
TGGGGATATG 
TAACGCTAAA 
TAGAGATATT 
TAAAGCTATT 
TACTGATAAA 
GGGTGTAGCT 
TAGAGGTTCT 
AACATTCCAT 
AATAAATGCC 
TGGTACGAAA 
GGACTTCGGT 
GGTTGAAGCG 
CGATGTGTGG 
AAATATTAAT 
TTGCTCT^AAA 
GGCGATGGAA 
GGACTTAACT 
AACATTTATG 
AATTCTATAC 



GAAGACCAAA 
AACAGAGAAA 
GGTGAAGCCT 
ATGTGGAGTG 
AGTTCTGTAG 
GCTTGGCAAT 
TCAGATGCCT 
TGGTTTAGCA 
CGCTCGAAAT 
GGAACATCGA 
TATTCAAGAG 
TCAGTATTTA 
GGTAAAGACA 
GGCGGTTTGA 
AATCTCATCA 
ACCGATAATT 
GGAAACGCCC 
GCACCCCAAG 
AATGACACTC 
AAGAAAAAAT 
TCTAAAGCTA 
GCAGGCGATA 
GATTATGTAC 
TTTGGAGGCG 
AGAAATTAGT 
GCTATCTATT 
TTAATGGCGG 
CTGTTAAAGG 
AAATTAAGAC 



AGAAGAAAAG 
ATATAAAGAA 
TTAGTAAAAT 
GAATCAAAGG 
GATATCACAC 
CTGTTAAATC 
CAGATAAAGC 
ACGCATATAA 
GGGATAATAT 
AATGGTTTAG 
CCCACGATCG 
ATGGTTTTAG 
TGGGAAGAGC 
ATAGCATGAT 
AGCCAATACC 
CAGGAGCATT 
CAGGTGGTGG 
GACGAGATGT 
TGAAGTTACA 
GGATGGAACA 
AAAACACAGC 
AAATCAAAGA 
AACATCCAGG 
GACTAACGCT 
AGACAAAGTA 
TGACCATCCA 
TCGTCACTAT 
CGGTATAGCT 
CGGTGCTAAC 



TGATAAATAC 
ATGGTTTGGA 
GGGCAGAAAT 
AATTCCAAGC 
TAAGGCTATA 
GACTACAGGA 
TTGGGCGCAT 
AAGTGCAAAG 
TTCTAGTACA 
TAACTCATAC 
TTTTGATGCA 
AAAATGGCTA 
TGCGGCTGAT 
TGGCX5GTATT 
TACATTGTCT 
AACGCAACCG 
AGTTCAAGAA 
GGTTGTTCCA 
GCGGATGGGT 
AGTTACTGAA 
TCATAATATC 
TGGTGCATCT 
GAAACTAGTA 
ACAGTAAAAA 
AAATCGTGGT 
ATTTGGCA7UV 
GGTATCGACT 
GATAAAGTAT 
GAATGGAACT 



GCTAAAGAAC 
AATGCTTGGG 
GCTAATCATT 
AAATTAAGTT 
GCTAATAGTA 
AGTATTTACA 
TCAAAATCTA 
GGTTGGCTAA 
GCATGGTCGA 
AAATCTTTAA 
ATTTCAAGTT 
TCAAAAACAT 
TTAGGTAAAA 
AATAAAATAT 
ACTGGTACTT 
ACATTTGCTG 
ATAATTCACA 
CTAGGAGTTG 
GTTTTGCCAA 
AATCTTGGTA 
AAAAAAGGTG 
TGGTTAGGCG 
AATAAAGTAA 
TTGCTAAAGG 
TTGAAGATTT 
GGTTTGGGAG 
TTGGTATGCC 
GGACTGATTA 
GGTATATGCA 



4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 ^ 

5580 

564 0* ' 

5700 

5760 

5820 ' 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 
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CAGATAAAAT TAATAGCTAT AGCAAATCAT TGAAGAACTC TAATGGTGAA AGTAAAAAAG 2940 

CAGCTGATTT GATGAAAGAT AACCTCAAAG GTGCTCTGGA ACAATTAGGT GGCGCTTTTG 3000 

AATCGTTAGC AATTGAAGTT GGTAAAGATT TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3060 

GATTAACAAA ATTAGTTGAT GGATTTACAC ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 3120 

TAGGCTTAGC AATTTTTGGT GCATCTATTG GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 3180 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT ATGCATCATT AAATAGACGC ATTGCTGAAA 3240 

ATACAATTCT TTCTAATACC AATTCAAAAG CAATGAAATC TTTAGGTCTT CAAACATTAT 3 300 

TTCTTGGTTC TACAACAGGA AAAACGTCAA AAGGCTTTAA AGGATTAGCC GGAGCTATGT 3360 

TGTTTAATTT AAAACCTATA AATGTTTTGA AAAATTCTGC AAAGCTAGCA ATTTTACCXST 3420 

TCAAACTTTT GAAAAACGGT TTAGGATTAG CCGCAAAATC CTTATTTGCA GTAAGTGGAG 34 80 

GCGCAAGATT TGCTGGTGTA GCCTTAAAGT TTTTAACAGG ACCTATAGGT GCTACAATAA 3540 

CTGCTATTAC AATTGCATAT AAAGTTTTTA AAACCGCATA TGATCGTGTG GAATGGTTCA 360 0 

GAAACGGTAT TAACGGTTTA GGAGAAACTA TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 366 0 

25 GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA ATTATCTTGG AAGTATAGGC AAAAGCTTCA 3720 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG GTTATAAATC TTTGAGTGAC GATGACCTTC 3780 

TGAAAGTAGG AGTCAACAAG TTTAAAGGAT TTATGCAAAC CATGGGCACA GCTTCTAAAA 384 0 

AAGCATCTGA TACTGTAAAA GTGTTGGGGA AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 390 0 
TAGAAAAATA CGTACACTAT TCTGAAGAGA ACAACAGAAT CATGGAAAAA GTACGTTTAA 396 0 

ACTCGGGTCA AATAACAGAA GACAAAGCAA AAAAACTTTT GAAAATTGAA GCGGATTTAT 4020 
CTAATAACCT TATAGCTGAA ATAGAAAAAA GAAATAAAAA GGAACTCGAA AAAACTCAAG 4 080 

AACTTATTGA TAAGTATAGT GCATTCGATG AACAAGAAAA GCAAAACATT TTAACTAGAA 414 0 

CTAAAGAAAA AAATGACTTG CGAATTAAAA AAGAGCAAGA ACTCAATCAG AAAATCAAAG 4200 
AATTGAAAGA AAAAGCTTTA AGTGATGGTC AGATTTCAGA AAATGAAAGA AAAGAAATTG 4260 
AAAAGCTTGA AAATGAAAGA CGTGACATCA CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 4320 
AGCAAGAGCG TATTTTAGTA AGAATGCAAA GAAACAGAAA TGCTTATTCA ATAGACGAAG 4 380 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 4440 
AGCAGTATGA AGATGATGTC ATTGCTATAA AAAATAACGT CAACCTTTCT AAGTCTGAAA 4 500 

AAGATAAATT GTTAGCTATT GCTGATCAAA GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4 560 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA AAAAGCAAAA TAAAGATATT GATAAAGAAA 4620 
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AAGAAAAACA lAGrGPJKQM^ PJ^TCKTTA CAGGTACGGA TTTPJsGPJKPJ^ (rrTTTTGGPJ^ 
GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGACCTTGGA GCTGAAATTA 
GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 
AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 
CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 
GTAGAAGATG AGCTTAAACA AGTTAACGCT AATTACC7VAA AAGCTAAATC CAGTGTAAAA 
GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA AAAAAGAAAA ATTAGCTCTT 
GATAAATCTA AAGAAGCCTT AAAATCATCG AATACAGAAC TTAAAAAAGC TGAAAATCAA 
TATAAACGTA CAAATCAACG TAAACAAGAT GCGTATCAAA AACTTAAACA GTTGAGAGAT 
GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT 
GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 
GCAATCAAGT TCAA7WVCTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 
TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 
ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 
ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT TCATCCGAAA 
TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGGTAAA CTTGCAAGTC 
AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 
GTACAATGAC GATGGGCGTA TCTACACCGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 
CGAGTGCAGA CTTTGAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 
AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 
CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 
CAATGGAGGC TATGCC7VGGT GTTATCAGCG CAgcAGaAGC AAGTGGTGCA GAAATGGCTA 24 00 

CAACTGCAAC TGTAATGGCT TCAGCX5ATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 2460 
ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2520 
^5 TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 

ACACTTCTGC AGCAATTGAA GTTTTATCTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 
CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTACA GCTAAGGAAA 2700 
50 TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 2760 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TTAGCAACAG 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3900 

ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3960 

CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4080 

TCXSAAGTTTA CTGGTTTAAC CAATGCACTA ATCT^TCGTCA AAATGACTAG CATAATAAAA 4140 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 
'5 (A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 60 

2S AATAAtCAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 120 

CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 180 

TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

30 TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 

ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 360 

AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 420 

AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 480 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 540 

ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 600 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACT^ AAGAAGATGT 660 

AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 720 

AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 780 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 840 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 900 

ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATTTTATCGA 960 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 1020 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 


2100 




ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 


2160 


5 


GTCGTTTGTC GCCATGGCAT TTCTCTTTCA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 


2220 




TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAAIT AATrGTCTTT 


2280 


10 


ACTCATAATG li'T'rrATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGrTAT 


2340 




TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 


2400 




ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 


2460 


IS 


ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 


2520 




GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 


2580 




ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATrTTT AATATTCACG 


2640 


20 


TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 


2700 




CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 


2760 




AAATGGATTC AATTAAGTGT GATTAACACA TTTCACCCTA TTATTTTCAC AGTTCTTTGT 


2820 


25 


ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCArTTG CAGTCTTCTT TCCTATAATC 


2680 




GCTTTACTAA TCATTTATTA CATGATTCXST TTTAAAATGA GAGCCGTAAT TAAGCAACAA 


2940 


30 


GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 


3000 


AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATCCAC ATGACTATGT TGGAAAAGCA 


3060 




TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 


3120 


35 


ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTr TTCATCAATC 


3180 


TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTr GATTGATTTG 


3240 




CXSTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 


3300 


40 


GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 


3360 




TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 


3420 




GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 


3480 


45 


TGCrrriACA aaatctaaaa tttatattgt tgctaacaaa ctacctttaa tgactcgaaa 


3540 




TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 


3600 




CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 


3660 


SO 


CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 


3720 




attgcgtgtg tcgtatacct tgctaatacg aaatttactg cagctattaa acataagaat 


3780 
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TTATATTCTA TTCAATTTAA TCTATGGATA CTGTGTCCCC ACACGACAGC AAAAGTTATC 300 

ATACTTCTTT ACATCACTAA GTCAATATAA ATGATTTAAT CAGTATTTAC ACTTTATTTG 360 

CTTAATACTG TCTAATTTTT TTGTAACGTT CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 420 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT GAAGAATGTC CACGTTGCGT CTTAAAATAA 4 80 

TATCAATATC ATGCGGATAA TTCATTTGAT TCATATGTTG CTCATATTCA TCTTCATCTA 540 

ATAAATGATA CTTTCCGTTT GGATATACTT TAATATCTAA ATCATAGTCT ATATATTTTA 600 

ATGCCTCTTC ATCACAAACA AATGGTGATG ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

CTCTAAACAT GCAGATAACA TTaAACCAAT ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 720 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT CAGTCACTAA CGTATGATCA TTTCCACCAA 780 

TGACAACATG ATCAGTACCC TTTAATATTG TTGTTTCAGA CCAAACGCXSA TGAATCTTAC 84 0 

CATCATGTTT ATAACTCTGA ATTTTAATGT TTTCCCCTTC TTTAGGTATG GATTCTCTGA 900 

CCATACTCCA CACCACCTTC TGTTAATTTA ACCATTATAA ATTATAGCAT ATTTCAGAAA 960 

TAGTATTATA TAAATACATA TTTTTACGAA ATAAGATTTT ACTACTTAAT AATTAAACTC 1020 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT TTACATGTCC CATTTAAAGT ATATAAAATC 1080 

ATCACTTTTA TATATCAACA CTTTAACTTT TTGACATTGT TATTCTATGA GATTTAAAGA 114 0 

TATCATTTAT ACTTTTTAAA ATTAATGTCA CTATGTTTTC CGATAATATT ACCAATCATC 1200 

GAATGTTACC CATTTATAAA TTGATAAATs TTTGACATAG GTACAGGGAA TGTATATTGA 1260 

TCTCGATCAC TTAAATCAAA CCAAATCATG TCATCTGGTA ATGTTTCAAT GTTAATTGCT 1320 

CCTGAAACGG CGTATACTTT AATCTTCCAT GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

AAAATAGGTG TTTCTACTGG TTGAATGTCA TGACCGATTT TTTCAGTCAT TTTACGTCTA 1440 

GCATGCTCAC TTTCAAACAT AGGAAATTGC CACATACCAT GCAATAATTT TTCGCTACGC 1500 

TTTTGCAACA GATATTGACC TTGATTATTT CTAATTAAAA AGACGGATTG CTCAATTACT 1560 

TTTTTACTTA CATTTTTAGA TTTAACAGGT AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

TCACAGTTTT CTTGAACTGG ACAAAATAAG CATAATGGAT TTTTTGGTGT ACAAATTAAC 1680 

GCCCCTAATT CCATCATAGC TTGATTAAAC GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 1740 

AATTCTTGTT CGTACGATTT CCTCGTCGAT TGTAATTTAA TATCTCGATA GTCATCATTC IB 00 

AATCTAGACC ATACTCGAAA AACATTTCCG TCTACAGTTG CTAGTGGTAC ATTATATGCA 1860 

ATGCTCATTA CTGCAGCTTG TGTGTATGGG CCAACACCTT TTAACGCTTT AAATTGATCA 1920 

GGATCTTTGG GAACTAAGCC TTCATATTTA TCATGAACTT CTTTAATCGC CX3TATGAAAA 19 BO 



40 



45 



SO 



55 



SfXXaO: <EP 0786519A2J_> 



1385 



EP 0 786 519 A2 



5 



IS 



20 



25 



30 



35 





TTAAAACTAA GAGAAGCCAT 


CAATTATGAT 


GAAATGGTTA 


TTGTAGATAG 


240 


Til TY^ R r^r^'V 7\ 


TTCCTAATTA TGCkAAAAGG 


GATTGATGAA 


AAACTGAAGG 


GCTTTTCATC 


300 


AAluUI*! 1"! i' 


ATTTTAGGGG AATTGAATAG 


ATAGTTTTAA 


ACTATACGAA 


TTATTAATAT 


360 


J. lV»AotM.i 1 lA. 


ATTGAAATAA GTTTTAAAAA 


TTGGAGGAGA 


TAGATTAAGC 


GAAGTCATTT 


420 


& & R A J* 

AAAVj^j 1 AAG 


TTAAGTGTAT TCACAAAAAn 


TAGCCACACT 


CATATGACAT 


CGGATGAGTG 


480 


1 \ t. n. AAGG 


ATCTATGGGG GGAGGAAnCC 


ATAGATGTTT 


ACTTTGATAG 


GCCAGATTAA 


540 


A 1 A 1 1— AAAGT 


ATGCGATTAT TTATAGCTTG 


ATGCAAAAGT 


GGTATGCCTA 


TTAAAAGTTA 


600 


C X vjIJaCATaG 


C i i iTAATAT TCCGTTCAAA 


GGAAAGGGGC 


ATACAATTGA 


ACAATCTGTA 


660 


ATAGTACTTT 


TAACCAGCTA TGCTAAAAGT 


CTAGTAGGGA 


GAACAGTTGT 


CCAATCACAT 


720 


AAGAACCTCT 


AACTTCGTTA GTACGATTAA 


GAAAAGCTTT 


TTAGTTAGTA 


TGTAATACAA 


780 


TTTATTGACG 


CGCGTGAATC TCTTTTATAA 


GAGTGTGTAG 


GGAATGGCGT 


TGTATAAATT 


840 


GTATTAGAAG 


AACTTCTAAC GCATCTCTGT 


GGTTAAAAGA 


GATGAAGGGA 


ACGACAGTTT 


900 


aATTAAAACT 


GCATAAGAAC TTCTAGCTTT 


TCTCTCTCGT 


TCAAAGAGAA 


GCAGcTGTTC 


960 


G CAGTTT AAT 


CAAAACCACA TAAAGCTTTT 


AACTTTACTC 


TTTGATTTAA 


AGAGTGATAA 


1020 


ATG TTTACAG 


TTTAATTAAA ACTGCATAAG 


AACTTCTAGC 


TTTTCTCTTT 


CGTTCAAgAG 


1080 


A.^^^^A ^«^^n^^m 

AAGCAG CTG T 


TCGCAgTTTA ATCAAAACCA 


CATAAAGCTT 


TTAACTTTAC 


TCTTTGATTT 


1140 


AAAGAGTGAC 


AAATGTTTAC AGTTTAATTA 


AAACTGCATA 


AGAACTTCTA 


GCTTTTCTCT 


1200 


TTCGTTCAAA 


GAGAAGTTCT AATACCACCA 


TATCGTGCGA 


TCGGGAACGG 


TATATATATT 


1260 


AATAGGAGGG 


TAATATATAT TTAACGCACG 


ATATGGGACT 


ATTAGCCTTC 


GACTTTGTTA 


1320 


TGTTGATGTG 


TGGCCTAAAA TATTGGAGAT 


ACCAATATTT 


TAGGTTGCAT 


CAACATCA 


1378 


(2) INFORMATION FOR SEQ ID NO: 46 9: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 69: 
TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 
SO TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 600 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG €60 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 720 

TATrrGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 900 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 96 0 

tATATTTAwT GCACyTCmAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 24 0 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 300 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 406 
(2) INFORMATION FOR SEQ ID NO: 468: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46 8: 
SO AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 60 

AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 120 
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TATTCGATTT 


GAnTCTTTTA 


AAATTATGTT 


TAAAAACATG 


TCTAATGATT 


CCGCATCATT 


120 




TTTTGCACCA 


TCAATAAGCG 


TTTCAGCAAA 


CCCCTTAATT GAAGTAATAG 


GTGTTTTTAA 


180 


5 


TTCATGTGAA ACATTTGCTA 


CAAATTCACG 


TCTTAGATTT 


TCAAGTTGTT 


TCAGATTTGT 


240 




TATATCATGC 


A 1 L-Hl_aACTA 


AAATCCCtTG 


CAAAC'i-i"rrr 


TGAGACCTAG 


TTAAAATCGG 


300 


10 


AACG CATR A A 


*\if%l v-AAAGT 


ACTTGGCATG 


GACTTGGTTT 


ATTGCAACTT 


CCAATTGTTC 


J b U 


AT AAATAGG T 


1 1 1 i UAACTT 


TAAAACTTTC 


TAAAATTAAT 


TGCTCAATTT 


CAGTATTAAC 


420 




ATAGCC<5TT;a 


i iiVjs- 1 ACTT 


GTTCAATATT 


ATGCGAGATG 


TTGAACTGTT 


CATAATACGC 


480 


15 


TTTATTTGCA 


ACAACGATTT 
GT'ii-x-rAAAC 


TTCCATTTCG 
GGTTGGATTG 


ATCTATCATT 


AAAATAGCAC 


TTGGAATATT 


540 






AATTlTriGC 


TCATTATTAA 


GCTTTTGAAG 


600 




GCGTCGTGCT 


AAATCATTGG 


TAGACACAAA 


AAGCGCTTTA 


GTTTCTACAA 


CATTACTTTC 


660 


20 


AGGTACACGT 


ATGTGATAAT 


AACCATTTGC 


CAACAATTGT 


GTTGCATAAG 


TAACTTCTTG 


720 




AATGGGACGG 


ATTAATGTAC 


GCTTAAAACT 


ACGGCTTGCA 


AAATACAGAC 


AAATGAGTAC 


780 




AACTAAACAT 


GTCAAAATAA 


GATATTTCCA 


CAACGTCCAA 


TGCATTTCTG 


TAATATCGTT 


840 


25 


ATTGTAACCT 
CCTTT 


TTAATCCATA 


CATGATAACC 


GTTAACCTTC 


TTATtAAAAA 


TAAAAACGTC 


900 
905 



(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
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TTTGGTTAGC 


CCATAAAAGA AAAAAACAAG 


TAGTCATTTT 


TAAACAACAT 


ATCAAGTCTA 


60 


CCCAAGAAAT 


ACGTTTTGAC 


AAAGCGAAAG 


TGCTTGAACA 


CAAAGATGAA 


ATAGCAAATT 


120 


TTATTTCTTT 


CGAACCACAA 


AGTTTTGAAT 


TTTATTATTT 


TACAGAATCG 


GAATTTTCAG 


180 


AAGAACAATT 


AAATGAAGTT 


TCGCCAATTA 


GAATTAAATT 


CAATGTTATA 


AGACACACAA 


240 


AAGATTTGaT 


AAAGCATATG 


CCGAATATAT 


TTTTGGcTAG 


ACTTATTTCA 


GAAGATAATG 


300 


ATAAAAAGAC 


ATATATGTTC 


TATAAACGCA 


AAGTATTAAC 


CGATAACTTT 


TTAGATAAAT 


360 


ATATGCAGAA 


ATTTTCACCG 


GCAACATACA 


CAATAATATT 


TGTAAATGTC 


TTAATATGGT 


420 


TATGTATGAT 


TTTATATTTA 


AATAATTTTT 


CGGATGTAAA 


ATTATTAGAT 


GTTGGCGGGT 


480 
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ATTTAACACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA CATCGTAATA 2100 

TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 2160 

TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 2220 

ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 234 0 

GAGTCX5CCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 24 00 
AATAGCCGAC 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65: 



2410 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 
25 TTTATTAATT 6TAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 60 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 12 0 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 18 0 

CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 24 0 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 3 00 

ACTACTTTTG CAGTTGCGTT T A TTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 360 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 480 

TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 540 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 590 
(2) INFORMATION FOR SEQ ID NO: 465: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 



55 



nooaO: <EP 0786519A2_I_> 



1381 



EP 0 786 519 A2 



2S 



30 



SO 



GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 
CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTCATCTAG 
TATTCACCTG TTTGCGCTTT CGCCGTACX3T tCTCGCTCTT tATAATACCC TTTCAAAAGT 
GCTGGCAAAT CAAGTGGTAC TGCAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 



300 
360 
420 
480 

TC7VTCATCTA CTACAGTGAC CGAACTACCC GGAATGCCrTT TACCCATTGA TCCAATCCTC 540 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTCGCC ATATCCATCT 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTCGCTCA 660 

CCTGCAGAAA CGGCACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

GCCATCATAC GATATTCTGT CXW3TGTACAA CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTCAA 84 0 

20 CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTrGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 960 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTCC CAGTTGTACC AGATCTATAT 1020 

GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTCCGTTT ' 1080 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200^ 

GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 126 0 

CATGGAATGA TTGCAACACC TAATTTTAAA GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TCCTAAAACA 1380 

TTACCTACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACITCCTT 1560 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 
TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 
4S ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 
CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACGAT TAATTATTTT 
TACATCTGCA CCTTTCGAAG TGCCACGATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 
TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 
AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATGCTT 



1680 
1740 
1800 
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1980 
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AGnTCAATAT TTAGATCAAG CCGTTTTAAG TAATTACGAA CAAGTTTATA TCATTCATGG 120 

TAAAGGTACA GGTGCACTTC AAAAAGGTGT ACAACAACAT TTGAAAAAGC ATAAAAGTGT 180 

5 TAGTGACTTT AGAGGTGGTA TGCCAAGCGA AGGTGGATTT GGCGTTACCX3 TTGCAACACT 24 0 

AAAATAAATT ATAATTTGAT AAATTAAATA GCTGCAGTTA AAATAATGTA AAGCAACAAG 300 

AATACATTTC AAACATGTTA TTTGAAATAA GCATAAAAAT TGAGCAAATA GAAATACATG 360 

AAGCATGTTA TCTGATATAA TTTGAACATC ATAATAATAA TTAAGGAGGA TTGGCATTTA 4 20 

TGGCAATCGT AAAAGTAACA GATGCAGATT TTGATTCAAA AGTAGAATCT GGTGTACAAC 480 

TAGTAGATTT TTGGGCAACA TGGTGTGGTC CATGTAAAAT GATCX5CTCCG GTATTAGAAG 540 

AATTAGCAGC TGACTATGAA GGTAAAGCTG ACATTTTAAA ATTAGATGTT GATGAAAATC 600 

CATCAACTGC AGCTAAATAT GAAGTGATGA GTATTCCAAC ATTAATCGTC TTTAAAGACG 660 

GTCAACCAGT TGATAAAGTT GTTGGTTTCC AACCAAAAGA AAACTTAGCT GAAGTTTTAG 720 

ATAAACATTT ATAAGTTACA ACCAATGACG ACTGGGGCAT TTCTTTAATG AATTGCTCCA 780 
GTTTTTGTTT GTGTTTTTAA TATAAAAAGT TGAATGATAA GTCATCATAT TGTTTACGAC 840 
TTGAGAATGG TGGGATTAAT AAATCTATGA ACGTTAAATG ATAATCTAGC ATGCTGATAG 900 
ATTTGTAGCA GTTGGTTTGA TAAAACCATG TTCAATATTA CATGATGTGC ATGAAAAGTC 960 

ATACTCGAAG ATGTTGATTA TTAACTAGAA TTAGTGGTGA TAAATTTGAA GCACTTTTGT 1020 

AGCATCATTC ATTTTAAAAT TAGAAGGGGG GATATTTTTG GAAGACTATA AGCAACGAAT 1080 

TAAAAATAAA TTAAATGTCG TACCTATGGA ACCAGGATGC TATTTAATGA AAGATCGTAA 1140 

TGATCAAGTG ATATATGTTG GCAAAGCTAA AAAGCTAAGA AATCGATTGC GATCATATTT 1200 

35 1209 
TCACGGGTG 

(2) INFOimATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 60 

TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 120 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 



tNSOOCIO: <EP 07e6519A2_L> 



1379 



10 



25 



30 



SO 



SB 



EP 0 786 519 A2 

AAGGCITTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG nTGGCTTCC 
AAGCTGAAGG CGCATCACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 
CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTCGGATAA GGCGACTAAT GCTCTTAAAG 
AATCAAATGG ATTAATAGAT AGTGTTACTG ATCATGAAAT TCTAGAAGCA TATCAGTTAA 
TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT GCAGGTTTAA 
TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTGCT ATTTTAACTG 
GTAATGGATT AAAAGATCCT GATACTGCTA TTTCACTACT AGATAATCCG ATAAAGCCAT 
TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AGCTTTATAA CATGTCGAAT 2280 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



1860 
1920 
1980 
2040 
2100 
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GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TCATTCTATA 234 ( 

OGTATGGCTT TAGATAAATT TTTGCATCTG TCTCTAAAGG AAACATCAGG GACAAAATGG 24 OC 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAACTTTATT 24 6 C 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 2S2C 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CGTCAGCTTC TGCTTTAGTA 25 8 C 

GGAGCTATAT ATATCGCAAA TTATTTTGGT GATATCCAAC TGTCTAAACA TGAGGTATTA 2640 

CAATTAGCGA CTGAAATCGA AGGACATCCT GATAATGTTG CGCCGACCAT TTATGGTGGT 2700 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 2760 

CCAGACGTGG ATGTGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2820 

CGTGCTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2880 

^ ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 2940 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTCAAAACA 3000 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAOTC GTGCTGGACC AACTATTTTA 3060 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 3120 
TCATGCCATT CTGAA 

(2) INFORMATION FOR SEQ ID NO: 4 62: 

•»S (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GACAGCAAAT 
ATAAATCTAA 
AAATTTTAAA 
CTGTAGATTG 
ATTTATTAGC 
TAAAGTTTGA 
TGAATGCGAA 
TATCTAAAAT 
TTGGTTTTGC 
TTGTCATTAC 
GAGGAATTAG 
TTAAATTGAT 
CGTTAATTGA 
TTATTGGTGA 
CAGGTAGTGC 
ACACATTGCC 
ATGCAGAAAT 
GTTCAATTGA 
GAGTTGCAAA 
CGGAAGAATT 
TTAATTATAA 
AATGAAAATA 
AATATGTCTA 
GTTCATTTAA 
AAATTGTAAT 
GAGCAGGTTT 
CGCAAGCAGT 
TAGAAATTGT 
CATTTAGAAT 



TCAAGATACA 
AAAGAGACCG 
TGATGATTCA 
GTTAAGAACA 
AGTACATCTT 
AGCGAGTGTA 
TAATATTTCA 
GACTAAAGAG 
TGAAGCGGAT 
ATCATATTTA 
TGGCGTAACT 
TGGTAAGGGA 
TAAAAAGCAT 
TGCCGTTGGT 
CGTTGTCAGT 
ACCACATTTT 
TAATATTAAA 
AAATTTTGAA 
TTACGATAAT 
AATCACTAAG 
TGAGAAGATG 
CACCAAAATT 
AAATACTAGG 
AGATCGCGGT 
ATGCGCTTCG 
AAAAGCTATC 
AATGTATGGT 
AAAAGAAATT 
CGAAGGACAA 



TTAAATAAAG 
CTAAATATTA 
TTAGATATTA 
GCACTTAAAA 
AAACTTTTAG 
GCAGGTGGTA 
AAATTTATGG 
CAAACGACAT 
CCAACTGATG 
TCATTTAACC 
TTAACTGATA 
ATATATGAAA 
CAATTAGCAG 
GACACGATGT 
GATTTATTGA 
GAATTAAAGA 
GAAAAATCCA 
AATGAGTTAA 
CAATCATATG 
CATGGATACG 
GCAAGGATTA 
AACATTGAAC 
CATAGATTTA 
ATGGTAATGG 
ACTGGAAATA 
GTCGTAATAC 
GCAGAAATCG 
GCAAAAAGTG 
AAGACAGGCT 



ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

GCCAATATCA TTTAACTGAA GATGTTAATG 120 

TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

ATAAAAAACA TGTTATTACC GCAAATAAAG 24 0 

AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

TTCCGATCGT AAATGCCATA AATAATGGTT 360 

GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 4 80 

ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 540 

AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CTGTAGAGGA TGAATATAAC GCGATTTATG 780 

TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

CAGATAAAAC ACGGGAAATG ATGGATTCAG 960 

ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

CCGCTGTTAT AGTTGGATTG GAATCATCAC 1140 

AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

CTGTGACAAA AGCAAAAG;^ CAAGGTAAGA 144 0 

CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

CAGAAGGTAA AATTGCATTA GGTAAATTGT 156 0 

TTTCTATTGA AGGAAACTTT GATGAAGCTT 162 0 

GCGAAATCGA GCTTGTAAAC TCTGTCAATC 16 80 

CATTTGT^AAT TGTACAACAA TTAGACGGTG 174 0 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 
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ATCTGCAATT 


ATGGGCACAC 


CCAAGCTnAT 


GCAAGTAACT 


AAAGGAGAAG 


TACTTTTAGA 


60 


CGGTGTAAAT 


ATTTTAGAaT 


TAGAAGTTGA 


TGAAAGAGCA 


AAAGCAGGAT 


TATTCTTGGC 


120 


AATGCAATAT 


CCATCAGAAA 


TTACAGGTGT 


TACAAATGCT 


GATTTCATGC 


GTTCAGCAAT 


180 


CAATGCGAAA 


CGTGAAGAAG 


GACAAGAAAT 


CAACTTAATG 


CAATTTATTA 


AGAAATTAGA 


240 


TAAAAACATG 


GATTTTCTAG 


ACATAGATAA 


AGACATGGCA 


CAACGTTATT 


TAAATGAAGG 


300 


TTTCTCAGGT 


GGAGAGAAGA 


AACGTAACGA 


AATCTTACAA 


TTAATGATGT 


TAGAACCTAA 


360 


GTTTGCaATC 


TTAGATGAAA 


TCGATTCAGG 


GTTAGACATC 


GATGCATTAA 


AAGTTGTATC 


420 


TAAAGGTATT 


AACCAAATGC 


GTGGGGAAAA 


CTTTGGTGCA 


TTAATGATTA 


CACACTATCA 


480 


ACGATTATTA AATTACATTA 


CTCCTGATAA 


AGTACATGTA 


ATGTATGCTG 


GTAAAGTCGT 


540 


TAAATCTGGT 


GGTCCAGAAT 


TAGCAAAACG 


TCTTGAAGAA 


GAAGGATATG 


AATGGGTTAA 


600 


AGAAGAGTTC 


GGTTGAGCTG 


AATAATCTTA 


TTAATACAGT 


ATCCATGAGA 


TGTTCATCTA 


660 


TATATGATGA 


AAATGAACAT 


TTATACGAAA 


TAGTAAATTT 


CATCAAGTAG 


GAGGAAAAAG 


720 


TTATGACAAC 


TGATATTTTG 


rACaTTyCTG 


AAGAACAACT 


TGTTGATTAT 


TCTAAAGCCC 


780 


ACAATGAACC 


TTCTTGGATG 


ACAGAATTAC 


GTAAAAAAGC 


TTTGAAATTA 


ACAGAAACTT 


840 


TAGAAATGCC 


AAAACCTGAT 


AAAACAAAAT 


TAAGAAAATG 


GGATTTTGAT 


TCTTTTAAAC 


900 


AACACGATGT 


AAAAGGTGAT 


GTTTATCAAT 


CTTTATCACA 


ATTACCTGAG 


TCAGTAAGAG 


960 


AAATTATTGA 


CGTAGATCAT 


TCTAAAAACT 


TAGTAATTCA 


ACATAATAAT 


ACGATTGCGT 


1020 


ACACACAAGT 


TGATGATAAT 


GCATCGAAAG 


ATGGCGTTAT 


CGTTGAAGGT 


TTAGCAGACG 


1080 


CTCTTATGAA 


CCATAGTGAT 


TTAGTACAAA 


AGTACTTTAT 


GAAAGATGCA 


GTAACAGTAG 


1140 


ATGAACATCG 


TATCACAGCG 


CTACACACGG 


CATTAGTTAA 


TGGTGGCGTA 


TTTGTTTATG 


1200 


TTCCTAAAAA 


TGTAGTTGTA 


GAACATCCAG 


TACAATACGT 


TGTGTTGCAC 


GACGACGAAA 


1260 


ATGCAAGCTT 


TTATAACCAT 


GTTATCATCG 


TTACTGAAGA 






1300 



45 (2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTGT ACAACCX3AGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1814 
(2) INFORMATION FOR SEQ ID NO: 459: 



IS 



(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



35 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 60 

CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

30 TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 180 

TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 240 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 300 

AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 360 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

ATTAGCGAAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

TGGCCGGTTG CAGGAAGTTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 540 

TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

ATGTTAATTT TTTATTTTTA TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 660 

GTGGGTAATA AGAATAGGAG TTGTAT 686 
(2) INFORMATION FOR SEQ ID NO: 460: 

so (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 
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(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1142 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 
CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 
TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGIT AGGCGTGTTG 
GTAAACATGT GTATATCACT GTTTAAACGG TCTTTATAAG CTTCGGTACC TTGTACATCG 
2^ TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TlTTCTCTACC TTCGCTATCA 24 0 

GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 30 0 

GGATCTAAAT TTAAATTACC TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 
25 TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AAGTGATTAC TTTCTCATAA 

TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 
GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 
CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG 
TCTATTGGTT TAGTTCTTAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 
TCGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 
TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 780 
TTGTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACATCTTTG 84 0 

TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC CGTTGATAAA ATCGCTAATA 90 0 

TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 960 
AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 1020 
45 CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 1080 

TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 
ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 
CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTtCTAAC 
CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 1080 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACX3TGGT 114 0 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 12 60 

CA 1262 



(2) INFORMATION FOR SEQ ID NO: 4 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

^5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

CCTGGCTGCT TTATCAGCAT CTACTACTTT AAAACCGAAT ACGGATAAGA GTTCTGATAC 60 

TGTTGATTTT CCTGAGGCGA TTCCACCTGT TAGACCAATA ACTTTCGGCA TAATTTCACT 12 0 

25 CTTTCTTTAT TTTTGACATA CTGGACAATA ATGACTATTT CTTGTCGCGA TGATTTTTGT 180 

TTCAATTTGA CTTCCACACA CTTTGCATAC CGGCTGCTTA TATACATTAA GATGCAATTG 24 0 

CATCTCACCA GTTTTTCCAT CAGCATGACG ATAATCTGAA ATACTTGTAC CGCCATATTT 300 

AATACCTTCT TCTAGTACTT CTCTAACATA ATAAAAAACC ATTTCTTGTT GTTGGTGTGT 360 

TAAGTCTTTT ACTTTTTTAT CTGGTAAAAC ACCTGCACGA AACAACGCTT CACATGCGTA 420 

AATATTTCCA CAACCTGCGA TTACTTTATG ATCCAAAATC ACTTGTTTGA TTGGTTTATT 480 

3S 

CTTATTAGAC TGTTGATGAA TTCGATTTAA ATAATACGTC AATGCTTCAT TTGAAAAAGG 540 

TTCAGGOGCT ATTTCTAAAA ATGAAGGATA AGATGCTACA GACGCAACAT TTCTAATTTC 600 

TCCAAAACGA CGTATATCTG AATAAATTAA CTTTTTGTCA TTTGACAACT CAAAAATAAC 660 

40 

ATGCCAATGC TTACGATAAT TAGGTATCAT AATATCTTCA AGTTCATCTA CAATGAAAAA 720 

ACCGCCCGCC ATACCTAAAT GACTAATTAA TGTACGTTGT TCTCGTTTAT TATCTAGCTG 780 

AAAAACGATA TATTTACTTC TTCGTTCTAC ATTTGTAATG GTATAGCCTT CCGATAAAGT 840 

45 

TTTAAAAGTA TCTAATTCT^ TTCCTTTTAT AATTGTTTCC TTGCCTTGAG CTTTACCTTC 900 

GATTACTTTA TCCGAAAATA TAACGTGTTC AATTTTTTGA TTTATAACGT AGGGTTCAAT 960 

50 TCCTCTTTTT ACATGTTCTA CTTCTGGTAA TTCGGGCATA CCATTAACCT CACTTTATTT 1020 

TGCATCATAC CAGGTTGCAC CATAACTTGA GTCTACTTTT AATGGAACAT CTArTTGCAA 1080 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 
AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATC TGCAATATGA TGATACAACA 
ATAATGAGTA ITTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 
AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAA7TGGGA TGTGGATATA 
TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 
TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 
TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 
(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



720 
780 
840 
500 
960 
1020 
1060 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 
25 ATAATnACTA AATACnAAAG TTTAACTGTC TTACTAATAA TGACTATGTT ATAATTTTAA 

AAGTGATATT TTGGGTAATC GCTATATTAT ATAGAGGAAA GTCCATGCTC ACACAGTCTG 
AGATGATTGT AGTGTTCGTG CTTGATGAT^ CAATAAATCA AGGCATTAAT TTGACGGCAA 
TGAAATATCC TAAGTCTTTC GATATGGATA GAGTAATTTG AAAGTGCCAC AGTGACGTAG 
CTTTTATAGA AATATAAAAG GTGGAACGCG GTAAACCCCT CGAGTGAGCA ATCCAAATTT 
GGTAGGAGCA CTTGTTTAAC GGAATTCAAC GTATAAACGA GACACACTTC GCGAAATGAA 
GTGGTGTAGA CAGATGGTTA TCACCTGAGT ACCAGTGTGA CTAGTGCACG TGATGAGTAC 
GATGGAACAG AACATGGCTT ATAGAAATAT CACTACTAGT TTAGCTCTCC TAGATGATGG 
AGAGCTTTTT TCATGAAAAG AACACTTAAA ATTAACGCCy TGTCTTGaTA tAATGACaCT 540 
GCcTTGTTTT AAAATAGTAA GCGGATGCgT TAATGTATCA GCGATTAAAT TTGTTGGAAA 600 
TGTATAAAAA ACACAAGCTA AGAATAAAAT ACCTGTATAA AAGGAGAATC ATATATGTTT 660 
CAATTACTTG CAGTTTGTCC GATGGGATTA GAAGCTGTTG TTGCTAGGGA AATTCAAGAA 720 
TTAGGCTATG AAACAAATGT TGAAAATGGT CGTATATTTT TTGAaGGAGA CGCAAGTGCA 780 
ATTGTAAAGG CAAATTTATG GTTGCGCACA GCAGACCGAA TCAAAaTTGT TGTTGGACGT 
50 TTTAACGCAA CAACGTTTGA CGAATTATTC GAACAAACCA AAGCGCTCCC TTGGGAATCT 

ATAATTGATA AAGAGGGTMK CTTCCCAGTT CAAGGTAGAA GCGTTAAATC AACACTACAT 



60 
120 
180 
240 
300 
360 
420 
480 



840 
900 
960 
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GTAAATGTAT CGCCCTCTTT AACAGAATCA TCGATTGTGT AATTTGCTTT TAATTTTAAA 366 0 

ACATCACTTG AAGTTGCCCA AAATTCAGTT TTACCAGTAG TCTGATTAAC ATGTCCTTTA 372 0 

TCAATCGCAA TGTCAATATT TGAAAAATGT ACTTTATCAT TAACATTTGT TCCTTGTTGT 3780 

GGAGCTGCAA CAGTATTCAC TGCCATGCGA TTTAAAGTTC TTGGTTTAAT AGTCGTTGTT 3840 

TTAGGTGTAG TTGAAACATC TTTTGCTTGT GTTAAATTAC TTTTATCAGT TTCATTACTA 3900 

TATGTAGTTG ATGATTTATC ATTTGTTGTT ACATTGCTAG TTTTTGTAGT AGATTGATTA 3 960 

GCTGTAGCGT TTTGTGGTGA TTGCATGTTA CTACTAGTTT CTTTAACTGT TGCACTATCA 4 020 

CTCATTGTCA CTTTAGGCTG ATCTGCAGTT GCAGTTTGCG TATTGTCTTT TAGTTGACGA 4080 

CTATCAACTT TTTTAGTTGT TTTATTCTCA CTTGGGGCTG TCGTTTCATT TTTTGATTGA 414 0 

TTTAATTCTC CATTCGTATG TTCTGCCGCT TTAGCTTCAT GACCACTTAA CCCAAAAATC 42 00 

20 AATGTTGTCC CTACTAAAAT TGAAGCAGTA CCTACAGAAT ACTTTCTTAT CGAAAATTTG 426 0 

TTTAATCGAT TTGGTATCAT GCCTTTnCTA TTTGTnGCTG TCTTTTTATA ATTCATTTAA 4 320 

TAATACTCCT TTAAAATATC AAAATTTGAT AAATATAA 4 358 
(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs / 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

TTGACTTCTT AATTCAGCAT TTTCTGCACT TAATGCTTTG TTCTTTTTAA TAAGTTGCTT 60 

TCTTGCATAA ACTTCGGTAT CTATTTTACT ATTACTATAC CTTTGATTTA AAACTAATAT 120 

4^ ACCAATTAAT GCTACAATGA TAATGATAAG TACAACATAA 7VAAGACATTT TTTCACCAAT 180 

CCTTTTTGAC TTCTTTAACT TTGTATACAA TAATAATTAA TAAAGATTAA TTGTTATTCA 240 

ATTTCCCACA TTTTTATTAG TTGATTTTAG TTCATCATTG TTATAATCAA ATTATAAACT 3 00 

GACAGATATT GATGTTCAAT GAATATGACG TGAAAGATTC GTGAATTCAA GTTTATGTCG 360 

AATTTATGTT ATAACGGTCA TTTAAATGAC AGAATTAGGT CACTCATAGT ATTTTGAAGA 420 

TTGAATTCAT TAATTTTAAA ATGTATAATG ATATTTGTGA AAGCGCTTGC TTAGGAGGTG 480 

50 

TATTTGAGAG TGAATGAAAT GAATGCTAAA GAACAATTAG TGGACAATTT AATGAAAACA 54 0 

TCATCGCAAT TATTTAAATT TCACGGTGAA GTTGCCATGC AGCTTTTCTT AAATGATGAA 600 

55 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 
TCTGAATCGC TGTCAGAATC TGAGTCGCTA TCTGAGTCTG AATCTGAATC ACTGTCTGAG 

5 

TCCGAATCGC TATCTGAATC TGAATCGCTA TCTGAGTCTG AGTCGCTATC CGAATCTGAG 
TCGCTATCTG AGTCTGAGTC GCTATCCGAG TCTGAATCGC TGTCTCAGTC TGAGTCGCTG 
TCTGAATCTG AATCGCTATC TGAGTCTGAG TCGCTGTCTG AATCGCTGTC TGAATCTGAG 
TCGCTATCTG AATCTGAGTC GCTATCTGAG TCTGAATCGC TGTCAGAATC TGAGTCGCTA 
TCTGATGTTT CTTCTTCGTA GTAGCCATTA TCAAGTGTGA AATCATCATG ATCCGTAATT 
,5 GTTACATCAA CTTCGCCACC ATCGGCATCT TTATCATCTT CAGTTCTATT TGTACCTGTT 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA ACTTTGTATT TACCACTATC TAAATTATCA 
AAGCGGTATT TACCATTTTC ATCTGTyTCA GTTGTACCAA TTACTTCGCC TTTTTCGTTT 
'0 TGCAAAGTAA CTTTAACACC TTTAATTCCT TTTTCAGTCG AATCTTGTTT ACCATCTTTA 

TTACTGTCGT ACCAAACATA ATCACCTAAA CTATATTTTG GTGTTTTGTA GAATCCACTA 
TCTAATGTCA TGTTGTCAGC GTCTTTAATG ACACCTGTTG TAGTTAGTCC ATCAGAATCT 
ACAGCATCAT CTGTACCTAC ATTTGCAGTT GTCGGTGTAT AACCGGCTGG TGTTGAAAAC 
TCTACACTAT AAGTTCCATT GCTTAAACCA GTGAACTGAT ATTTACCATT TTCATCTGTT 
GTCGTACGAT CTAATTCTTT ACCGTTACTA TCTTTAAGAA TGACATAAAC ACCTTTAATC 
CCTTTTTCAT TGGCATCTTG TTTACCATCT TTATTTGTAT CTTCCCATAC ATAGTCACCT 
AGATTATATT TCTTTTGGTC GCCATTAGCA GTTGATGAGC CATTCACATT TGAATAACTA 
3^ TTTGACCAAC TATATTTAGT TTTGTCAGTG TCTAAAGTAT AATCAATTTT TCCATTATCT 

GTTGAACTAT TATCTGGATA AGCAACTTGT TGAATGATGT ATTGTTTA-rr GCTGCTTGTT 
TGGCCTTTCA TTAAATCGAC TGTAGCTGTT TTATTATCAT TACTATAAAT AACATCGAAT 
40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA TCAGGGGTGA AACTATCCAC AAATTCATTT 

TGATCTGtCA CTtCGTAAAT TTTGAAGTTT TTTGCATTTC GATTAAATTT ATATCCAGTT 
AAATTAGTAA CAAACGTTTG TTTAGTATAT GTATTTTTAG GTTGATTTAC ATATGCAGTC 
ATATTACGCG ATAAATCTTC ATTGTTAATA TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 
TTTTTATTAC CATAATCGAC AATGATTTCT TCGCTATATG TATCATTACC TAAAGTTACT 
TCCATTTTAT AAGCTGTTTT ATCAGTTGTT GCATTTTTAC GTTTCGCAAA TGCAACTTGT 
TCAAAGCTAC CTCTAACATT TGTATATTGA TCTACATAGT TCGTAAAAGT ATATGTTGTT 
GTGTTTGTTG TACTATCATA AATACCTTTT GCAATAATAT TACCTTGGGC ATTATATAAA 



4S 
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ATTTGGATCT 


TTAATATCAC 


CAATATTTTT 


AATATCTTCC 


GGATTCAATC 


CATATACTTG 


60 




TACTGTATCT 


GAGTATTTAA 


TTGTGAAATA 


ATCACCTGAT 


TTAACTTTGT 


CATCAACTGT 


120 


5 


AATTTGTGAT 


TTTAATGATA 


AATAATCTTG 


GGCTGGTACG 


ATTTTATTGT 


TTTTATCTGC 


180 




ATCAACGACA 


GTtAATGTTG 


TATTTGATGT 


GATTAAATCA 


TTAACATTTT 


TAGCCTCTGT 


240 




TGATGATGGC 


TGTACTGCTG 


CTATACGCAT 


TCTTGTATTC 


AAACGTTTAG 


GTGCTGTACT 


300 


10 


TTTTGGCAAA 


ATGATATCTG 


CATTATTTTC 


ATTATTTGAA 


TTACTATTGT 


TATCAACAAG 


360 




AGTTTCATCA 


TTACTCTTGA 


TAGCATCACT 


TTTAACATTT 


AATGTAGTTG 


ATTCAGTTTT 


420 




GGCATCTACC 


TTTTTGTTTT 


CCTCATTAGT 


TGGTTGAACA 


TTTACCACTG 


ATTTATTCTC 


480 


TTGCAAATCA 


GGTTGTAACG 


CTTCTTGATT 


ACTTATAGTT 


TGTTTAGTGT 


TTAAATCTTC 


540 




ATTCGTAGAT 


TTTGGTGAAG 


CTTGCTCATC 


TGATTTGGCA 


GTTGAAACTT 


CAACTTTATT 


600 


20 


TCCAGTGGTA 


GATTGTACAC 


TTTCTTTTTC 


TATTAATTTA 


TTCCCATTTG 


AAGTCGTTTC 


660 




ATTACCTTGa 


GATGATACCA 


TTTCTTTTTG 


ATTATCATTT 


TTAGTATTGT 


CTTCTTGATT 


720 




TAGTTGCTGC 


ATATCT^CTT 


TATCACTCGA 


TTGATTATCA 


CTTGCTGAAG 


TTGTCGCTcG 


780 


25 


TTCAATTCTT 


TATTAGTACT 


TTCTGCAGCC 


TTTGCTTCTT 


GGTTCCCCAG 


ACCAAAAATT 


840 




AATGTTGTAC 


CTACTAAAAT 


TGATGCTGTT 


CCCACTGTGT 


ACTTTCTAAT 


CGAAAATTTA 


900 




TTTAATCGAT 


TGGATACCAT 


GCCTTTCCTT 


GTTATTGCCG 


TTTTATTTTC 


TCTGTTTAGC 


960 


30 


ATTAGATTAC 


TCCTAATTCA 


TCAAATTTTT 


AAATAATACA 


ATTGTTTTAA 


ATACAAAAAT 


1020 




GTATATCAAT 


ATAGTATTAC 


ATTTTTAGAT 


AAAGCACAAT 


ACTTTAATTA 


TTTTTCTTTA 


1080 


35 


TCGTAAAACG 


TTATTTAACA 


TTTGTGTTTA 


AATAAAAGTT 


TTTATGAGTt 


TTGTAATCTT 


1140 


TATTTAATCA 


TCATAAAAAA 


TAGTATTATT 


TGCCCTTGAA 


ATTAATATCT 


TAGCTTTTCT 


1200 




AATTCATAGA 


CAATTACATT 


TCTGTAACAA 


ATTAAATTGT 


ATCTATTCCT 


TAAAGATTTT 


1260 


40 


riXJTITlATA 


TCTGGGAATT 


TCTAAACAGA 


AAAAACCAGG 


CCACATGGAC 


CTGGTTAAGT 


1320 




TAATCATATT 


ATTTATTTTG 


m-ri-rACGA 


CGACCGAATA 


ACAATAATGA 


TCCTAATGCC 


1380 




GCGAATAATC 


CACCGAATAA 


TGTGCCATTA 


TTTGAATTAT 


TATTTTCACT 


ACCTGTTTCT 


1440 


45 


GGTAATGCTT 


TAGCTGTTTT 


ATGCTGATCT 


TTAACCX3TAC 


TCATTGGTTT 


AGCCGGAGTA 


1500 




TGTTTACCTG 


CATCTGAATC 


TGAATCGCTA 


TCTGAATCTG 


AGTCGTTGTC 


TGAGTCCGAA 


1560 




TCGCTATCTG 


AATCTGAGTC 


GCTGTCTGAA 


TCTGAATCGC 


TATCCGAGTC 


TGAGTCGCTA 


1620 


50 


TCTGAGTCTG 


AGTCGCTATC 


TGAATCTGAA 


TCGCTGTCTG 


AGTCTGAATC 


GCTATCTGAG 


1680 




TCTGAATCGC 


TGTCCGAATC 


TGAGTCGCTA 


TCTGAATCTG 


AATCGCTATC 


TGAATCTGAG 


1740 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34 8 0 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 3540 

TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 3 600 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 3 660 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 3720 

GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 3780 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 384 0 

GAGACTTCAA AA?VGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3 900 

TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3 960 

GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4 020 

AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 4 080 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACX3CCTAT 414 0 

TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 4 200 

2S AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 4 260 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4 3 20 

ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 43 80 

TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 4440 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATATTTTT 4 5 00 

ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4 560 

ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCTTT 4620 

AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGkCTTT TGTTTATCGA 4680 

AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 4740 

AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4 800 

ATTAACACGT ATATTTAATA 4 820 
(2) INFORMATION FOR SEQ ID NO: 454: 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
SO (D) TOPOLOGY: linear 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCX3GTTAC ACCATTAGTA ACAATATATA 1680 

AATCATGTCT TTTCGATAAT TCGACAATTG TTTCTAATGT TTGATCAAAG TATTTAACTT 174 0 

TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

GATGATGTTT GAAGTAATTC ACT^TCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT 1860 

TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT T^AATGkTAAA AAATCATCCy 1920 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

TCTCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 234 0 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGC CATTTTCAAC 24 00 

GATGCTCACA ATGCTTGTGC CTAATTTTGC GCCATCATTA TTAATTGCTG CAGTTACGGT 24 60 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAACACT CCTTAAAATT TAAATTTGAA 2520 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

30 GTTGTTATTT TAPiAATPJKAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATTG AACCTTTTTA GATAAATAGC 2760 

35 TCTATAATAG TGAAAAATAT AAACATTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2 880 

ATGGCCTTCT AAGCCATCGG TCGGGGGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2940 

TTATACGCAT TACCAAACAA AAGAGTTCCG TGATTACX5GG GCTCTTTTTG TTTTGAATTT 3000 

CAGTAATATA GTATGATGCG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

ACACTTGTTT ATACATTTTT AATTAAGTAA TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 324 0 

TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3300 

GGGGCCCGAA ATCCTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTCCATCG 3360 



25 



40 



45 
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(C) STRAhfDEDNESS : double 
CD) TOPOLOGY: linear 



5 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


453 : 








TCTCCAGTAG 


ACCTTGTGTA 


TGAACAGTTT 


CTTTCATATG 


AATGAACATC 


GTTTTTAAAG 


60 


10 


TTTGTTTCAA 


CTCAGCCTTA 


TCAGGATAAT 


ATCTAGAGAC 


AGTCGTCTCT 


GGCATCTCCA 


120 




TTGTATGATA 


TTTAACCTTA 


TGCAGCGACC 


CATGATTTTC 


GTTATAAACA 


AATGTATGAT 


180 




TCACTTCATC 


GAAATCATGA 


TCTTCTCCTG 


CAATCCAAAA 


AACTGGTACT 


ACTTGTTGCT 


240 


15 


TATGTGTATC 


CGTTAATTCC 


TTAGATAAAG 


TAATGATTGa 


AAATATTTTA 


TGGAATGTAT 


300 




ACAATGGTCC 


CCCGAAAAGC 


CCTGCTTGTT 


GTCCACCAAT 


CACAACTTTT 


GAACCATTAG 


360 




CTAAATGTTG 


TATGTTTAAT 


TCTTGTTCAC 


TTGAAAGCTT 


GAAAGCTTTA 


AATCACTCAT 


420 


20 


ATATTCA_CGA 


pjv AACATTCG 




ft 1 J /T.**.''*^** I T * o ^ * 


TTATTTTCTT 


TAGACATC CT 


480 




•iTTiTCAAAA 
ACTGCTCTTT 


CTAGTTTGTT 


GAGCTGCATC 


ATATTGAAAT 


AATCCTGTTA 


TTACAGGGTC 


540 


25 


ATTTTTGGTA 


TAAACTGATC 


TTTTTCATTT 


AAACTAACTA 


CTTTACAGTC 


600 


CATGTTTTTT 


CTCCTTAAGT 


ACGCGATTAC 


AAACTATAGT 


ATAAAGTCTA 


TACCGGTGAT 


660 




TGACAATTTT 


ACGGCTTGAA 


AATCAATTTA 


ATCATGGAAA 


ATTTATAATA 


TTCATTGTTT 


720 


30 


TACATTTTCA 


AATCAATGAA 


AAACACAAGT 


GGTTTAATGT 


ATAATAATAG 


TAGTAAACAA 


760 




ATAAGGGGTA 


GATAAATATG 


AGTGAAATCA 


AACGTCTTGA AATTAATTAC 


AAAACTGACG 


840 




AATTATTCGA 


AAACTTTAGA 


GCGTTTGGCA 


ACAAAGACTT 


ATACATGGTC 


AATGAGTTAA 


900 


35 


ACGGTCAAAT 


GATTGATGCA 


AGTTCAGATT 


CACCATTTTA 


TGGCATATTT 


GTCGaGATCA 


960 




ATTAGGAGCT 


AGAATGGCAT 


TACTAAAAAA 


AGGTGATGTC 


GAAGAAATCT 


ACTTCCCAGA 


1020 




TTTTGAAGAT 


TATATATTAT 


TATGGAAGTT 


AGAAGTATTA 


CCAAAATATC 


AAAACAGAGG 


1080 


40 


GTACGCTTCA 


GAATTGATTG 


ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 


1140 




CAGAAATGAT 


TCTAAGGATT 


TcrrriTACA 


TCATGGATTT 


ACAGATGTGG 


AAGCTAAAAA 


1200 




TATAGAGGGA 


CATGATGTCT 


TATTGTGGAA 


ACCATAAGAT 


AATAATATTC 


GACACTACGA 


1260 


45 


GCATGAAAAT 


GCATCTTTTC 


GTAGTGTCTT 


TTTTACAATT 


ACTTTCTTAA 


GCTAATATAA 


1320 




GTaAATCATT 


TTCAAATTAT 


TTGTCTTAAC 


GTACAATATC 


ATTTAGTTGT 


TTCCATGrAT 


1380 


SO 


TAATTTCATA 


ATCAGGTATA 


ATTCCTGGAT 


TATGATCAAA 


TCCTCTAAAa 


TTAAACCAGC 


1440 


AAGTAGCTAT 


ACCcGCATTG 


ATTCCACCTA 


GAATGTCAGA 


TGTTAnAGAA 


TCTCCclACTA 


1500 




TAATCGAGTG 


CTGtCTTTCA 


TCCTCACCAA 


TATCATTAAA 


AACATAATTA 


AAAAATTCCG 


1560 



55 



1366 

BNSCOCtD: <EP 078651 9A2_I_> 



EP0 786 519 A2 



AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 4 BO 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 54 0 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 4 base pairs 

(B) TYPE: nucleic acid 

^5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

GCACQAGTGA TTAAACGGTT AATCAATGAA ACATTTGATG CAAATTACAT TGAAGTTATT 60 

GAGGGAGGAA TTGAAGAAAC GCAAACGTTA ATTCACTTAC CTTTTGACTA TGTCTTCTTT 120 

25 ACAGGAAGTG nAAATGTAGG CAAAATCGTT TATCAAGCTG CCAGCGAAAA TTTAGTTCCT 180 

GTGACATTAG AAATGGGCgG TAAAtCTCCa GTCaTCGtTG ATGAAACAGC GAATATTAAA 240 

gTTGCTAGTG AGCGCATTTG TTTTGGGAAA TTCACTAATG CCgGCCaAAC ATGTGTTGCa 300 

CCAGATTACA TTTTAGTACA CGAATCTGTA AAAGATGATT TAATCACAGC CCTATCAAAA 360 

ACGTTGCGTG AATTTTATGG TCAAAATATA CAACAAAGTC CAGATTATGG CCGCATTGTA 420 

AACCTTAAAC ATTATCATCG TCTGACTTCA TTACTTAACA GTGCACAAAT GAATATTGTA 4B0 

35 

TTTGGTGGTC ATAGTGATGA GGATGAACGT TATATAGAAC CAACATTGTT AGATCACGTT 540 

ACAAGTGATT CAGCAATTAT GCAAGAAGAA ATTTTTGGTC CTATCTTACC GATTTTAACG 600 

TATCAGTCAT TGGATGAAGC AATAGCCTTT ATTCACCAAA GACCAAAACC TTTGAGTTTA 660 

40 

TATTTATTTA GCGAAGATGA AAATGCTACA CAACGTGTAA TAAACGAGCT ATCATTTGGC 720 

GGCGGCGCTA TTAATGATAC ATTGATGCAC CTAGCGAATC CTAAATTACC ATTTGGTGGT 780 

45 GTTGGTGCCT CAGGTATGGG ACGCTATCAT GGTAAATATT CATTCGACAC TTTTACACAT 84 0 

GAAAAAAGCT ACATTTTCAA ATCnACACGA TTAGAATCAG GTGTCCATTT ACCACCATAT 90 0 

AAAGGTAAAT TTAAATrCAT CAAAGCTTtC tTTAAAnATT AATT 944 

50 (2) INFORMATION FOR SEQ ID NO: 453: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4 820 base pairs 
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CAGCAATATC 


TATTTGATTr* 

•a»X iX ^^J^^IX A 


MX XWy^i.urv.>v\ 


nAC^ 1 U X 1 v. 


i» X AAx GAAAT 


ATATCACACT 


840 




GAATCCAAGT 


CATTTCAAr'A 


J- v-r\ X ^ X o 1 X X 


K.X X X/\X X^XV_ 


x\34jATATTTT 


GATATAGCAA 


900 


5 


AAAGTTCAGC 


ATCATTTTCA 


ATCAPTT PA P 


"1 — I'R R ATAPTT 
X Xxxrw-\XnV_X X 


A PC A liT*R*T*I\T« 


CCTGTTCCAC 


960 




CTGCTAATAA 


AACTTTAGGT 


X X ^^.AX v» X 


AX/\\k.X^wX X X 


AAAU HaTAAC 


CAAAAAACAT 


1020 




ATTACTCCAC 


CTTTTAGTTA 


CATATATATT 

^hTV X/A X X>^ X X 


AXAAXaPTAP 


^— AAA 10* X ill 


ft ft ft TV fT^n**!^/^ ft ft 

AAAATTTCAA 


1080 


10 


AATACTGGAG 


GCri'T'l^rATG 


GPPPA TATTA 
v7^^v_nx/AX x^\ 


T A PPT 2VP li r2T 


*P li ft ft ft ft 

i Avj i A 1 CAAA 


O ft T'^T' ft ft ft ft 

GATGTAGAAA 


1140 




ATTTCATTTC 


AATGTTAGCG 


AAPATAT A Pn 


A PP ATPTPr* 


vj X A X A 1\» i 1 C 


X ACACACCAG 


1200 


IS 


GAGAATATGA 


TCCTAGCGTr 


APATPT^PTIi 
r\x v..vjrvjv_ Xx\ 


PTA 2i A/^Ii liTT 


ftr^ft ft/^ft ft^ftn^ 
AvjAACjAATAT 


ATCACTTCTC 


1260 


CGCATAAAGT 


Vv'^ X W X A X X 


f3 PTY3 A A A r; TTP 


ATV^ A Ar*A ik/'^nr 


CGTTGGCTTT 


GCCTTTGTTA 


1320 




ATACGACACC 


ATTTCAAPflP 


ATTA A APaTT: 


X X\3V.XAAA/\1 


TGATTTAGGT 


GTAAAGAAAT 


1380 


20 


TATATCAACA 


TCGTGGAATT 


PnPPA APPAP 




CATTATGGCT 


TGGTGTTTAA 


1440 




ACAATCT^AAT 


APA PP(^ A fi-TT* 


r^A aPP^ ^ RTO 


X ACt-At, I CAA 


TAACCAAC ct 


GCCCTCGAGC 


1500 




'rrriT AAAAG 

X X X X X ^W^tAVJ 


XtJWVvVJAX X X X 


Zi & R 'V/^r^ TV 7i o 
V«r\A/\ X V^oA/iVj 


(jrt-tjl XxTAAA 


AGATAAGTTA 


TTTATCGATG 


1560 


25 


GTAAATATTA 


TGATGACTAT 


ATGATGGCTA 


AAATTCTTAA 


TTAAAGCTAT 


TTTATCATAA 


1620 




TCTTGTATCA 


GAATCGTATA 


ACAACGAATT 


TAATGGTTAC 


CTAATACATT 


ACTCATACTT 


1680 




ATCAATGTTA 


TCTAATCTCA 


AATAAATACG 


TACACTCTTA 


TTCATTTATC 


AAATTTAAAT 


1740 


30 


TCAAAATAnA 
TATATATGTG 


ACACCACTAA 
GTTAGTTTTC 


TGTGTAATTG 
CTnACTA 


ATTAACTATC 


AACTACGATT 


AGTGGTGCTT 


1800 
1827 



(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 
ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

45 

TGACGTTATT CTATGkTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 
TTATTAAAT^ GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 
so TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTT/U^CAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 3 00 
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TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 960 

AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 1020 

TGCAATATAT CATCAGCCAC TTTTAGCATT AATCX5TTGCA GGAAGTTTAA CTTGTGCGAT 10 80 

GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG ATGAATAAAT TAAATATCGA 114 0 

TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 1200 

TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 12 60 

TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 13 20 

AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG TGATTGGAAA 138 0 

TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 144 0 

TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 1500 

ACGCGCACGT CAAGGACAAA ATGATGATG 1529 
(2) INFORMATION FOR SEQ ID NO: 450: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



^0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 6 0 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTT T GCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 24 0 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 300 

CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 360 

GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 420 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 4 80 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 540 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 600 

SO CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 660 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 720 



55 



SOOCID: <EP 07e6519A2J.> 



1363 



10 



IS 



20 



25 



30 



35 



40 



45 



EP0 786 519 A2 

TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATG 900 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTGATG GAATGCTGTA TCAAAAATTG 96 0 

CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 1020 

GGTTGTGaAG CGGTGCTAAT TcGCTTAATT CTTCAATTTC CTTTTCAACC TCATCAGTAA 10 BO 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT GTTCCATCGA 114 0 

TATCGTTAAT ATCATTAATA ATATTGTGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 1200 

CCTCAACGTG ATCCTTGATA TCTTGTACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 1260 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTTGTTACT AATTCCTCTT 13 2 0 

CAGGCATTCT AATTAATTGA A 134I 
(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1 52 s base paix^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
.(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

TTTTGAAGAT ACTACCGATG AAAATAGACA AAAGATTTTT CAATATTTAT CACCTGAAgA 6 0 

AGTTGCAAAT TTCTTTGATC AATTAGATAT TGATGACGAT GAATATGAGT TGCTATTTGA 12 0 

TAAGATGAAT GCGACATACG CAAGTCACAT ATTAGAAGAA ATGTCATACG ACAATGCAGT 180 

AGATATTTTA AATGAGTTGA CTAAACCAAA AGTTGCTAGT CTTTTAACAT TGATGAATAA 24 0 

AGATGACGCG AATGAAATCA AAGCATTACT TCACTATGAT GAGGATACGG CCGGCGGTAT 3 00 

TATGACGACG GAgTATTTAT CACTTAAAGC GCATACGCCT GTTAAAGAAG CATTATTATT 36 0 

GGTCAAAGCG CAaGCACCAG ACGCAGAAAC AATATATGTT ATATTTGTCG TTGATGATGA 420 

TGGTAAATTA GTAGGTGTTT TATCGCTAAG AGATTTAATT GTAGCTGAAA ATGATGCTTA 4 80 

TATTGAAGAT ATTATGAATG AACGTGTCAT TAGTGTGAAT GTAGCAGACG ACCAAGAAGA 54 0 

TGTTGCTCAA GTTATGAGAG ACTATGATTT CATGGCTGTA CCTGTTATAG ATTACCAAGA 6 00 

ACATTTGCTT GGTATCATCA CGATTGATGA TATTTTAGAC GTTATGGATG AAGAGGCTAG €60 

TGAAGACTAC TCTCGTTTAG CCGGGGTATC AGATATCGAT TCGACTAATG ATTCAATCAT 720 

50 rPJkPJKCP.GQA TTAPJKACGTT TACCATGGTT GATTATTTTA ACATTTTTAG GAATGATTAC 7 80 

TGCGACAATT TTAGGGAGAT TCGAAAAAAC ATTAGAAAAT GTAGCGCTAC TCGCAGCGTT 84 0 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 1260 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 1320 

^ GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 13 BO 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 144 0 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

10 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AGACGGTGCG CCAGCTTATT 1560 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 1620 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 1680 

IS 

TACAAGATAA GCATTACCCT CACCTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 1740 

AGCAGGTCAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 17 99 
(2) INFORMATION FOR SEQ ID NO: 44 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 
ACTTGGTTTT TTATTGTTTA TAAATAAAAC TCACTTAATA ATGTTTTCAT AATCTTCTTC 60 

GACTACTTAA TTCTTTAAGA TATTCGTGAA AAGAGACATT ACACTAGTTA ATTTTCAAAC 120 

AATACAAAAA GCGTCTACCT CCTACATATA ATTGTAGCGG AGATAGACGC TTAATATTTA 180 

35 

TTTAAAAATT ATTTTAAACC ACCGAATGTC ATAACATCAC GGGCAATCAT ACTTTCTTCA 24 0 

TCTGTTGGAA TAACGACAAC TTTAACTGGT GAATGAGGAT AGTTAATAAA TCCTTCTTTA 300 

CCACGTAGTA AGTTTTCATT TTTCTTAGGA TCCCAGTAAA CACCCATAAA TTCTAAGCCT 360 

40 

TCAAGAACTT TCGCACGAAT TTCTACTGAG TTTTCACCGA TACCTGCTGT AAATACGATA 420 

ACATCAACAC CATGCATTCT CGCAGCATAT GATCCAATAT ATTTGTGAAT TTTAGAAGCA 4 80 

AATACATCTA AAGCCATTTG TGAACGTGCT TTACCTGATT CAGCTTCTTC TGATAAGTCA 540 

45 

CGTAAATCAC TAGATGTACC TGATAATCCT AATAAACCTG ATTCTTTGTT TAAGATTTCC 600 

AATACTTGTT CAGCAGTTTT ACCTGTTTTT TCCATAATAA ATGGAATTAA AGCAGGGTCA 660 

SO ATATTACCAG AACGAGTACC CATTGTTACA CCAGCAAGTG GTGTGAAGCC aTTGATGTAT 720 

CAATAGATTT ACCGCCATCG ATAGCTGCAA TTGATGCTCC ATTACCAATG TGACATGAAA 780 
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TTATGCAGAT TTPOkCP^TCK TTGATTTAGA TAGTGAACAA GAAATTAAAG GAGAAGATTT 
CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTITATGGAA ATCCGATCTT 
AACAATGGTT GAAGGCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA gCAAACGTTA 
TCTAGTGTTA GAAGACGGTC TTTTACGAGG CTACCGTTAG GTCTGATAAC TTACTGA 
(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1380 
1440 
1500 
1557 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 7: 
2^ GCTAGAAATm TTGtnATGaCA ATACAACTCT GTTAAAaTGA TGGACGTAGA CAAATATGCG 

TATTGACGCT TTATTTTAAA AATTaACATG CTTATAACAT GTTTATAGAA GGAGATTAAC 
CTATGAACTA TCaAGTTCTT TTATATTATA AATATATGAC GATTGATGAC CtGAACAGTT 
25 TGCTCAGGAT CACTTAGCCT TTTGTAAAGC ACACCATTTA AAAGGTAGAA TTCTTGTTTC 

TACAGAAGGT ATTAACGGCA CATTATCTGG TACAAAAGAA GAAACCGAAC AATATATGGC 
ACATATGCAT GCCGATGAAC GATTCAAAGA TATGGTGTTT AAAATTGATG AAGCTGAAGG 
ACATGCTTTT AAGAAAATGC ATGTACGTCC TCGAAAAGAA ATCGTTGCTT TAGATTTAGA 
AGATGACGTC GATCCAAGAC ACACAACTGG CCAATATTTA TCACCTGTAG AATTTAGAAA 
AGCTCTTGAA GATGATGACA CAGTCATTAT TGATGCACGT AATGATTATG AATTTGATTT 
AGGTCATTTC CGAGGTGCAA TTCGTCCAAA TATCACACGT TTTAGAGATT TGCCTGACTG 
GATTAAAGAG AATAAAGCGT TATTTGCAGA TAAAAAAGTG GTTACGTACT GTACTGGTGG 
CATTCGATGC GAAAAATTTT CTGGATGGCT TTTAAAAGAA GGTTTCGAAG ATGTAGCTCA 
ACTTCATGGC GGTATTGCTA CATATGGTAA AGATCCTGAA ACAAAAGGTG AATATTGGGA 
CGGTAAAATG TACGTATTTG ATGACCGTAT CAGTGTTGAT ATCAACCAAG TTGAAAAnAC 84 0 

AATTATTGGT AAGGATTGGT TTGATGGCAA ACCATGTGAA CGTTATATTA ATTGCGCTAA 900 
CCCAGAATGT AATAAACAAA TATTAGTTTC TGAAGAAAAC GAAACTAAAT ATTTAGGTGC 
ATGCTCTTAT GAATGTGCTA AACATGAGCG TAATCGTTAT GTTCAAGCAA ATAATATTAG 
SO TGATAATGAG TGGCAACAAC GTTTAACAAA CTTTGATGAT TTACATCAAC ATGCTTAGTT 

TTAATTAAAT ACCTTTCAAA ACACGCTTTG AAAATCCGAT TTATAAAGGT TTTTCAAGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



960 
1020 
1080 
1140 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CXSATTTAGCT ATnACTCCAA ACCTTCCATT 
GGATTTGGTA AAGGnT 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



2160 
2176 



15 



20 



25 



30 



35 



40 



45 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

AAAAGCATGG CTTAAATGAA GTACGCTATA ACAAATTACA AGAACATGCT ATTGTTATGC 60 

ATCCGGCACC TGTGAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 120 

CAAGAATTTT TAAGCAAATG GAAAATGGCG TTTACTTAAG AATGGCAGTC ATTGATGAAT 180 

TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT AAAAACGGTA AAGTATTACA 240 

AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTGC 300 

ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC ACTTTGTGTC 360 

ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 420 

AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 480 

TAACACAAGA CCGGTACCAG ATTCTGTAGA ACATTTTGAA GCTTTACAAA AATTAATCGA 54 0 

TGACAATGCT CAAGTACGTG TATTACCTTA TGCTTCAATT ACAACACGTC AATTAGGTAA 600 

AGAATTGGTT GATTTCCCAG CACTAGTAAA AGAAGGTGCC TTTGCGTTTA CAGATGACGG 660 

TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 720 

CAAAGCCATC GTAGCACACT GTGAAGATAA TTCATTAATC TATGGTGGTG CAATGCATGA 780 

AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTGTG AATCTGTTCA 840 

AATCGCAAGA GATGTACTAT TaGCTGAAGC AGCAGGTTGT CATTATCATG TATGTCATGT 900 

TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGOGCAGGCA TTCATGTTAC 960 

AGCTGAAGTT ACACCACACC ATTTATTGTT AACAGAAGAT GATATTCCTG GTAATAATGC 1020 

CATTTATAAA ATGAATCCAC CATTGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 1080 

GTTACTAGAC GGTACAATTG ACTGTATCGC AACAGAcCAT GCACCACATG CACGTGATGA 1140 

AAAAGCACAA CCAATGGAAA AAGCaCCATT CGGAATTGTT GGTAGTGAAA CAGCATTCCC 1200 

ATTATTATAT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 1260 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 3 60 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 480 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 54 0 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 66 0 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 720 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCX3 TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 840 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

20 CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTC^lATCAC 96G 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 1080 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 1200 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 1260 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 1320 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 1380 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 1440 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATCATTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 1560 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 1740 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 1860 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 1920 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 204 0 

55 



30 



35 



45 



SO 



BNSOOCtO: <EP_0786519A2J_> 



1358 



10 



IS 



EP0 786 519 A2 

TTTGTAATCT TTTTAACTTC CAAATTaTCG CATATAAATA TGCTATATTA ATGATAATAA 10440 

TTATCAATTA AAAGGAGGTT ATGCTATGTC TAAAGAAGCT GGTCATACAT TTTTAGCTAA 10500 

ATTAGGAAAA ACTCX3TCTAC GCCCCGGTGG TAAAGAAGCA ACAGATTGGT TAATACAACA 10560 

AGGGGCATTT TCACAAGATA AACAAGTGTT AGAAGTGGCA TGTAATATGT GCACAACATC 10620 

TATTTATCTA GCTCATACAT ATGGCTGTCA CATTCAAGGC GTTGATATAA ATAAGAAAGC 10680 

ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAGGTCTT GAATCATATA TTCAAGTTCA 10740 

ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAATCAA TTCGATATCG TTTTAAATGA 10800 

AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 10860 

AGTCTTAAAG CCTGGGGGTA TCTTGTTAAC ACATGATATT GTCATCGTTA ATGAATCACA 10920 

TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCX3CAGAC 10980 

20 GAAACTTGGC TGGTTAGATT TATATAATCA AGCTGGTTTT AATCATGTGC ATTATCATAC 11040 

TGGTCCAATG AGTTTAATGA CACCAAAAGG TTTAATTTAT GACGAAGGTA TTGTTGGAAC 11100 

TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 11160 

TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTTGTCG CTAAAAAAGA 11220 

GCACTAAATA TAATGCCACT AACTGTACTT TGTATCTATG TTTGACTATC ACTTTAATTT 11280 

CTTTGTGACA CTAATCATCT ACTTAACAAT ATCGTTATCG TTGATTAGTA AGTCATCAAT 11340 

TTTGGTTAAA GACTTTCATA AACACTCAAA CATTAACACT ATACATAGTT AGTGGcATTA 114 00 

TTTTTTyCTn AAAATTTTAA CmTCmCGGGr TtGGGAmCrG AAaTGrtAwT TcGCrmAACT 114 60 

TAwTcT 11466 
(2) INFORMATION FOR SEQ ID NO: 445: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 60 

ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 120 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 

CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 240 
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AGCAGAGAAT 


CCATTCCGTA 


ATAAAATTAC 


GTGGTTACAA 


ATTATTTCAG 


GTTTCTTCTT 


8640 




TGCATTTGGT 


GCTTTAACAT 


ATCTTATTTC 


AGCACAACCT 


AATATGAATG 


GTTTAGCAAC 


8700 


5 


TGGATTTATT 


CTTTCTCAAA 


CATCCGTTGT 


GCTTGCTACA 


TTAACTGGTA 


TTTATTTCTT 


8760 




AAAACAACAT 


AAAACGTCAA 


AAGAAATGGT 


TATTACAATC 


ATCGGCTTAG 


TACTCATTTT 


8820 


10 


AGTAGCCGCT 


TCTGTTACAG 


TATTTATAAA 


ATAAGGAGTG 


TAGATGTCAT 


GAAAAAATCA 


8880 




GCTGTTTTAA 


ATGAACATAT 


TTCAAAAGCA 


ATCGCGACAA 


TTGGTCATTT 


TGATTTATTA 


8940 




ACGATTAATG 


ACGCTGGCAT 


GCCAATTCCA 


AATGATCATC 


GTCGTATCX3A 


CCTAGCTGTA 


9000 


15 


ACTAAAAACT 


TACCACGCTT 


TATTGATGTC 


TTAGCTACAG 


TGTTAGAAGA 


AATGGAAATC 


9060 




CAAAAAATAT 


ACTTAGCAGA 


AGAAATAAAA 


GAACATAACC 


CTACACAATT 


GCAACAAATT 


9120 




AAACAATTGA 


TTTCATCGGA 


AATCGAAATC 


ATTTTCATTC 


CTCACX3AAGA 


AATGAAAAGT 


9180 


20 


AACTTAGCTC 


ACCCATTAA.A 


TAAAGGTAAT 


ATTCGTACTG 


GTG AA3 CAA C 


w «v w X J. ^ J. 


ft j» 




AATATTGCAT 


TAGAATCGAA 


TGTTAC11"rr 


TAAAAGTTAT 


AACTTGAAAG 


GAGCGTACAC 


9300 


25 


ATGACCAACA 


AAGTTGTTAT 


TTTAGGTTCA 


ACGAATGTCG 


ATCAATTTTT 


AACAGTTGAA 


9360 


AGATATGCAC 


AACCAGGCGA 


AACATTACAT 


GTTGAAGAAG 


CACAAAAAGC 


ATTCGGCGGA 


9420 




GGTAAAGGTG 


CCAACCAGGC 


TATTGCCACT 


GCACGCATGC 


AAGCAGACAC 


AACATTTATT 


9480 


30 


ACTAAAATTG 


GCACTGATGG 


CGTTGCTGAT 


TTCATCTTAG 


AAGATTTTJ^A 


AGTAGCTCAT 


9540 




ATTGATACAT 


CATATATTAT 


CAAAACAGCT 


GAAGCAAAAA 


CGGGCCAAGC 


CTTTATCACT 


9600 




GTGAATGCAG 


AAGGACAAAA 


CACCATCTAT 


GTTTATGGTG 


GTGCGAATAT 


GACGATGACA 


9660 


35 


CCTGAAGATG 


TTATTAACGC 


AAAAGACGCT 


ATAATCAATG 


CAGACTTTGT 


CGtTGCACAA 


9720 




TTAGAAGTAC 


CCATCCCGGC 


TATTATATCT 


GCATTTGAAA 


TTGCCAAGGC 


ACATGGTGTG 


9780 




ACGACAGTAT 


TAAATCCTGC 


ACCAGCGAAA 


GCATTACCTA 


ATGAATTATT 


ATCATTAATC 


9840 


40 


GATATTATTG 


TGCCAAACGA 


AACAGAAGCC 


GAATTGTTAT 


CTGGGATTAA 


AGTAACTAAT 


9900 




GAACAATCTA 


TGAAAGACAA 


TGCCAATTAC 


TTTTTATCTA 


TAGGCATTAA 


GACTGTTTTG 


9960 


45 


ATTACGCTAG 


GTAAGCAAGG 


TACATATTTT 


GCTACTAAAA 


ATCAAAGCCA 


ACACATCGAA 


10020 


GCTTATAAAG 


TAAATGCGAT 


TGATACAACT 


GCTGCAGGCG 


ACACATTTAT 


TGGTGCATTT 


10080 




GTCAGTCGCT 


TAAACAAGTC 


GCAAGATAAC 


TTAGCAGATG 


CTATTGATTT 


TGGTAATAAA 


10140 


SO 


GCGAGCTCAC 


TCACTGTACA 


AAAACACGGC 


GCGCAAGCAT 


CTATTCCTCT 


ACTAGAAGAA 


10200 




GTAAATCAAG 


TTTAAATGAA 


TCAAACACAG 


CTATGATATG 


AAGGTTTAGC 


ATATAACATG 


10260 




CAACATTCGT 


ATATCATGGC 


TGTGCTTTTT 


TATCTTTATA 


AAACATCATC 


TATTAGAAAT 


10320 
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TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 684 0 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAAGATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 714 0 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 73 80 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACX3ACATTGA 7560 

TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGC7VC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

AOGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 8040 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATGG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 8340 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 84 00 

SO 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCACCX5CA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 
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TCACAATATG TCATTGCAGT TATCATTGGT TTCGTCATAT GTGCGATAGG TAATGGTTTA 
GTCGCAACAC CTGGACTTAC GATTGCAATT TTCAGTATGC CTAATCAAAA AGTTGGTTTA 
5 GCTACAGGAT TATATAAAAT GAGTGGTACA TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 
ACTACAGTTT TCAGTATGTT ACAACTAAAC TATGCACCAA GTGTAGCTGC AACCGTAACA 
TTTATAGTCA GCATTGTATT GATGATCCTT GGCTCATTGT CTGCATACAT GATCATTCCA 

70 

AAAACAGTTA AATCTTAAAT ATAATAGAAG AATTATGTTT CGAAATATCT TTATCACTTT 
AAAATGATAT ACAAGAAATC CAAGAAAAAT AAGCGAACTG AATAAATAAA GATTCAAITA 
ACGCATCAGT ATTAGGATTC ACTCTAAAAC GATTAATAGT TTTATAAGAA GGTCTTTGAT 
CTTGAGCTAA CCACATCATT CGAATACTGT CATCAAGTAA TITCTCTATT CTACGACCAG 
AAAATACAGA TTGAGTATAT GCATATAAGA TGATTTTTAA CATCATCTTT GGATGATAGG 
20 ATGTTGCGCC ACGATGATGT CTGAATTCAT CGAATTTGCT ATCAGGTATC GTTTGAACAA 
TTTCATTAAC ATGTCGCGAA ATATCATTTT GAGGAATTCT AACAGAAGTT TTTATTGGTA 
GTGTAAGTTG GGCAAAGTGT CTTATTTTTT TAAAGTATTT CAAAGTAAAA TTACATGTTA 
ATACGTAGTA TTAATGGCGA GACTCCTGAG GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 
GACGGCACCC TAGGAAAGCG AAGCCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC 
^ AGTAAGATAT TTTCTAATTG AAAATTATCT TACTGCTGTT TTTTAGGGAT TTATGTCCCA 594 0 

GCCTCTTACT CTAATTATAT TCACTATCAA TTAGACAAAA TGGCCATTTT CAAATATCAC 6000 
GCGTTGTTTC TGACCTTGAA TATATTTATT ATAATTCTCT TTTTGAAAAT CAGTTAACAT 6060 
TAATTTAGAT GTACCGTATT TTAACACTTT TTGCATTGTT TCTATTCTCA TrTTTCTAAA 6120 
TAACCATCCA TCTTTTAACA CAATACGATT AACAGCATCA TATGATAATT CTACTGTTTC 6180 
TTTAATTTCA AATGTCTTGA ATGAAATAAT CGTGCACATT AAAAACGTAT CACCAAAGTA 6240 
40 ATAAACATCT AAATCATCAC GTTTATGTTG TCCAACTAAC AAACGACCAT ATTCGAACTC 6300 
TTTTTCTGGA TATTTCAATT CTAAAAAACT AATAATCTCT TCTTCTTTTA ATTTGAATTC 6360 
CATTTAAAAA CATCCTCTCT TAAGTTTTAA CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 
ATAGTTCAAT ATACATCATT TCGTTATGTT TTTTAATACT TTGTTCAAAA ACAAATATTT 64 80 

TATTCTTTAA AATAATGACT TTTGTATTTT TAATCACAAT AAACATTTTA AAATTCTTGT 654 0 

TATCATAATC ATTAAAAGGT ATTAACCTTA ATAATATATT CTCTCGTCTC AACCTTAATC 6600 
GTATACTTCA GACGTCTGTT TGTAGACAAT AAAAGTCATT CACGTCTTCA TATGTCATCA 
AATGTTTATC ATGATATGAT GAATATAATA ATCGGGTATA TAACTGTATG ATTAATTACA 



5700 
5760 
5820 
5880 
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CACCAATAGG ATTATCnTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 3240 

GTCCAGTTGC ATCGGAAACA TGATAATGCA ATGGCGGAAC TTCACCGATG TCATTTAAAT 3300 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 34 20 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 3480 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 3 54 0 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3600 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 3660 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3780 

AATTAATTGA TATTCATTCT CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3 84 0 

GTGTGAGTGA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3 900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3 960 

ATTGCGATTA TGTTGTCTGC ATTAAGATAT TGGTTGTTTG CACAATCATT TATTAATATA 4 020 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4 080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 4140 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 420 0 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4260 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4380 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 4440 

40 ATTGCTTTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCX3TCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4 560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4 620 

''^ CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4740 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4 800 

SO 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4 860 
GCGATGATTC GTTTAGGTGA AAAAGCAACA TTACGTTTCG GTGGCAAATT GATGCTCATC 4 920 
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CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAGTATG 1440 

ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT ISOO 

GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 1620 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 16 80 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 174 0 

AATGCGCACC ACCACCGTGA TATTGTCCAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 192 0 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 198 0 

GATTAATATG CTGTTGGCTT TGACTCGCTG TGTA.GTCATT GTT.ATCTCCT GCTGTTGCTG 2G4G 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 216 0 

TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 222 0 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 228 0 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 234 0 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 2400 

TAGTTTTCAA TTAAACTGTA CTGCACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 24 60 

55 TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2520 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2580 

CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 264 0 

ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GATTGTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 2760 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2820 

TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 2880 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 294 0 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3060 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 3120 
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AATTGTGTAG CTTTGTGTAA G 5301 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

GAAGATGAAA GCTATATTGA AGATGATGTT ACTAAAAAGG AAGCTATTTT AAGTATGCAA 60 

ATTCCTAAAG GTTTCTCTCA AAAATTAAAA GAGAACCGTT TAT^AAGAAAC GATACAGTTA 120 

TATGGTAGAG ATGACTTTAT AGGTGGTATT GCTGTAGAAA TTGTTAGTAG TTCATTATAT 180 

20 GAGCAGCAAA TTCCTAACAT TATTTATGAA CACCTTGAGG ATATGAAACA GCATCAATCC 240 

ATCGATGCTA TCAACAAGTC CTATCATAAA CATACACCTG AATCTAAAAT CAAATTTGTG 300 

TCGCTTACTA AACAAGCACA ACACTCTATT TCAATTAGCT TAATCTTTGC GGTGATTTTA 360 

TTTGTTAGCG CTGTTCAAGT AGTACTTCAT TATCGTTTAA ACCAACAAGC AGCATTGCAA 42 0 

CGATTATCAC AATATCATTT AAGCCGTTTC AAACTATATA GTACTTATGT AATGACACAT 480 

ACGATTTTGT TATTGTTGGT ACTATTGGCA GTTAGTCTAT ATTTGTCTCA ACCACTCAGC 540 

TTAATATTTT ACTTAAAATC ACTGTTACTT ATATTGATTT ATGAGATAGG TATCGTCTTT 600 

ATCTTATTCC ATATTCAAAC AATAAGTCAT CGATTATTCA TGACATTTAT ATATGCACTT 660 

GCTATGGGTA TCGTATACTT GATTATTTTC ATGTAAAGGA GCGTAACTGA TGATAGAAAT 720 

TAATAACCTT TCAAAGCGTT ACCGTAACAA ACAGATTTTC AATCATTTAA CTATGTCCTT 780 

TGATAGTAAT CGTTTAACCG TATTACTTGG TGATAATGGT GCTGGAAAAT CAACATTACT 84 0 

40 TCGTATGATT GCTGGTATTG AAAAAGCTAA TGATGGAACT ATCAACTATT TCGGCGAAAA 900 

ATGGAATCAA AGACaAATAC aAAATCACAT CGGTTATGTG CCACAAGACA TTGCGTTATT 960 

TGAACACATG ACAGTGGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 1020 

TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 1080 

AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 1140 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 1200 

TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 1260 

TCATTTAGAT GAAGTTGAAG CACTTGCAGA TGATATCAAG TTAATTGGCC AAGATCCTTT 1320 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CACCATTTTG AATCGCATCT ' 
TCTGTITCAA ATGCTTTCGT CGCAGTTGTC GATGCACCTA ATGGGAATCC TATTACCGTA 
CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTCGATTC 

ACACATACAG ATTTAAAATG GTATGCTl-TC GCTTCATCGA TGATTTGATC GATTTGCGTA 3 720 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 3780 
CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 
TCATTTAAAT ATCAGGTTAG ATGTTCGCTT TTTATGTAAC CGCATACATA TACTATTACA 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACGAT 3 960 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4 020 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4 080 

20 ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AGTATTAATG CCTGGCGATC CGCTACGTGC AAAATATATT GCTCATAATT TTTTAGAAAA 4200 

TGTTGAACAA TTTAAOGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA 4320" 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 43 80. 

AGAAAATGTT AACTTATACG ATGTTATTAT TGCACAAGCT GCATCAACTA ATTCAAATTA 4440 

TGTAGATCAA TACAATATTC CAGGTCATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4 500C 

TAAAGCTAAA AATGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA ACGTATTATC 4560 

TTCTGATACA TTTTACAATG CCGATCCAAC ATTCAATGAT GCTTGGAAAA AAATGGGTAT 4 620. 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATC CTGGTAAAAA 4680 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACGT GACGAAGCTA CTACACCTGA 4740 

40 P^GMlCQTTXZM^ IATICK-ITTK CACAAATGAT GGAAATCGCT TTAGAAATCG CAGAGTAACT 4800 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT 4 860 

TCTGGGCGTA TCCCCATTTA TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4920 

AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

ACATATCTTC TGATACATCA TCGCCAGCAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 5040 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACX; 
TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 
ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 
GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 
TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCX3 TTTCAGCTTT 
GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 
ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 
TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 
AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 
GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 
AATTGAAGAG GCAATTAATG GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 
TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACTGCCA CTTTATTTTC 
ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 
TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 
AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 
TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 
ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 
CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 
AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 
TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 
CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 
TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 
GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 
ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 
GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 
CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 
TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCAGCGCCA 
CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 
GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

5 ACCTATAGCr GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG TTGCATGCCC sO 
AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 

CGCAAATATT TCITCAITAC CAAGTTCAGG ACGTTCATGA ATTCTATCAC TGATTTCGAT ,eO 

ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGlTriTCAC TCATTTTTTA 240 

CTATCrCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTCTT CTTTCAAAAT 300 
GAATTATTAC rTATTCTATC GGTTTTATCT CATGATGTCA TCTAGITTTT CTTTAITTAC 

AAAATITTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAG7TTCA AATTCTAAAA 420 
CATAAGCCCT ATTGTTACAA TTTAAGTAIT GTTAGAAACA CAACTTACAA CAAACITAAG 
TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTITTGA GTAGITAATA GGAGGAAATT 

CAAAATGACA AAAATGAATG TrGAAAGTTT cAATTTAGAT CaTAcTAAGG TCGTTGCCCC 600 

ATTTATTAGA ITAGCGGGAA CGATGGAAGG ATTAAACGGA GATCTCATTC ACAAATACGA 660 

CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG CCOGGACTAC ATTCATTAGA 720 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 780 ' 
GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG ATGATGTATT 

AAATATTGTT GAAGCAACTT TAAA7.3ATGT GCTAAATGCT ACTCAAGTGC CTGCTTGTAA 900' 

TGAAGTACAA TGTGGCTGGG CAGCAAGTCA TTCATTAGAA GGTCCTAAAA CTATCGCTCA 960 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA AATAAATCTT 1020. 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 1080 

CTGCTTTTTA ITATGCTTCA AAGTCAAAAA ATCGAACAAA TCAAAAAGTA AAATCTTTAA 1140 

^ CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT TAACACGAAC 12OO 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 1260 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTCTA 1320 

TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 1380 
ATTCCTGTAA CAAGAACCGC TGTITTTCGA AGTGGCAAAC AriTCGGTAT GATATITCTA 

AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATCT 1560 
AGGCGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AATrrrTTAA CAACATCGTC 



25 



30 



45 



1620 



55 



^SOOCtD: <eP 07865 19A2J_> 



1348 



10 



IS 



20 



EP0 786 519 A2 

ATCAAGGAGC GAAACATCCG AATGAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT 1860 

ATGCTGCTTG GCAGAAATTT GCTAGACCAC GCTTTGaTAA TATGATTAGT TTTAATAGTA 1920 

CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 1980 

TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGTTATA 204 0 

AAAGGAAGTT AAATAGAAAA GATAACACAT AOGATACTGT ACCACTAATC AAAATAAAGT 2100 

CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 2160 

TAGATATGGA TGGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GCACGTTCTG 2220 

GAAGAGGAAC TTTAATTAAA CTTAATTTAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 2280 

CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 2340 

TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 2400 

AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATT AGTAAAATAA 2460 

ACTCACTTGA AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTGCTGTT AAACACTTAT 2 520 

TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTTT AAAGGTATAA 2580 

2S AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 2640 

ATATAGATCA CGACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 2700. 
TCAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 2760 
ATATTAGAAG TATTATGAAA CAACAAGATG CAACAATTTT AGATGGAAAT TTTCAAATTA 2 820 

ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 2 880 

AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 2 94 0 

AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 3000 
CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAATTGA AATGTTAAAT GTAAAAAAGA 3060 
AAGAATTTTT GTCCCTGATT CCTAATGTAA CTTGTAATTA TATTAAAACT AAATTAGAAG 3120 
AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 3180 
CAATTTTAGA TAACCATTTA GATGATATTG TTAAGAACAT GAAGCCTTTG ATTGTACAAA 324 0 

TGAyATwTGA ACCATCACAT TATGaCGATA TGTTTAATTC aAGAAAAGCT TTAACGcCAG 3300 
TGTTCTCAAG CGTTTTATAA AGGTTG 3326 
(2) INFORMATION FOR SEQ ID NO: 443: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 
ATAGATGTTA AAAGCGAATT AGGGAAAGGC JKCAACATTTA TTATTAAACT ACCAAAACCA 
^ GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TCTAATTTAT 

AGCATGCGTT TTTTGATTGA TGTGGGGAAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 
GCGAATATCA ACTATTAAAT ATATTACTAA TTTATACAAA AATATAAAGT TTGATAAAGT 

70 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTJ\AT TAAATTATTC 
GAGGTGTCAT ATGAAAAAAT TCATTGGATC AGTTTTAGCT ACGACATTAA TTTTAGGGGG 
,5 ATGTTCCACG ATGGAAAATG AATCAAAAAA AGACACAAAA ACAGAAACAA AATCTGTACC 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA 
TGCGATTGAA TTTGCGAAGA AGCATCGTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 
20 TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 
AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 
CAAAGATGCC ATCCATCAAA AAGGATCAAT GCGCAGTAAT GACAATGGTg ATGATATGAG 780 
TATGATGGTG GGTACAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 84 0 

TAACTTATAT AAATTCTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 900 
AGAGGCAATT AACAAGACAC AAAATGTCGG ATATAAAAAT GAATATTTTT ATATTACATA 960 
CTCTTCTAGA AGTTTAAAAG AATATCGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 102 0 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 1080 
3S AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 114 0 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 1200 
AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 1260 

40 

TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 1320 
CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 13 8 0 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 1500 
AGACAATAAT TTATCAAGGA ACATCTAATG AGGCAATTAA TCCAAATAAT CCATTAAAAT 1560 
SO CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 1620 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA AAGTTAGAAG 16 8 0 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 174 0 
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7UUVTTCTTTA TAGACTGATG CAAATCTAAC ATATGAAACT TGATCAACAT GCATTAACAA 1200 

GTTCATAACX3 TGTTCACCTA TATCTCGTGA AGACACTTCC GTATGACCTT CATCTCGTAA 1260 

5 TTGCCATTCA ACCTTGTTAG TTATGTCTTC AAGTTGTTGA TATCTAACTG GTCGTTTCTC 1320 

ACAAGAACGC ACAAGTCCAT TAAGTATCTT TTCTCTTGAA AACTGCTCTC TTGTGCCATC 13 80 

TTTTTTCACA ACTATAAGCT GACTAACTTC GATATGTTCA AATGTAGTGA AACGTGTTCC 144 0 

10 

ACAATTTTCA CATTCTCTTC GTCTTCGAAT GGCATTTAAT TCATCGGCAT GCCTTGAATC 1500 

TACAACTTTA GATTGTGTAG AATTACATTT CX3GGCATTTC ATTACATCAC CCTCTTTATT 156 0 

TTGATTATGC CTAATTATAC TATAAATCTA GAGATGAAAA AAGAATCCCT CAATTTAATT 1620 

IS 

CATTTAACCA AATAATGAAA CAATAAAAAA CATTATATCG TTACTTATTA AGTAATTTGC 1680 

ATGACAATAT TATTGTATTA PiPiMiTPJKAPA ACCTAACTCC GAAGTCAGAG TTAGGCTATA 174 0 

20 AATTAATTGT ATTAACTTGC ACTTACAGTT TCTTTTGATG TCAAAAGTGC TCCAATTTGC 1800 

TCAGCAACAT CTACAACTCT ATTTGAATAA CCCCATTCAT TATCATACCA AGCAATAACT 1860 

TTTACTTTAT TCCCTGACAT GACCATTGTT GATTTTGCAT CAATAATAGC TGAATTTGGA 1920 

25 TTAGTATTAA AATCAACAGA CACTAGTGGT TGATGTTCGA CTTCTATGAT ACCTTCTAAA 1980 

CCTGCATTTT CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTTC TAAATCAAC7V 204 0 

ACTAAATCAA CGAGCGATAC ATTCTTTGTT GGTACACGTA ATGCCATGCC GTGTAATTTA 2100 

30 

CCTTCTAATT CTGGTAATAC TTCTTTTAAA GCTTTCGCCG CACCAGTAGA AGTAGGAATA 2160 

ATGCTTTCAT TACATGAACG TGCACGTCTT AAATCTTTAT GTGGATTATC AATATTTTTT 2220 

TGGTCATTTG TAATAGCGTG AACAGTAGTC ATTAAACCAT TAACTATTCC AAACTGATTA 2280 

35 

TTTAAAACTT TTGCAACTGG ACCAATGCAA TTAGTAGTAC ATGAAGCATT ACTAAAAATG 234 0 

TCAAATGCTT CTATATCTAA TTGGTTATCA TTTACGCCTT TAACTACCAT TTGAACATGT 2400 

CCACCTTTTG nAGGACCAGT TAACAAAAsT TTTtTGGCAC CTGCTTTAAT ATGTGCGATG 2460 

40 

GCTTTATCAC CATGaTTAAA TTTACcAGTT GCATCTATAG CAATATCGAT ATCTAATTCT 2520 

TTCCATGGCA AGTTTTCAGG ATTGCGATCA GCAACCAATT TAATTTTATG ATCT 2574 
45 (2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 
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ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACX3TAAAAAT TAAAAATACT 
ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 
GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 
TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 
ACAACACTTC CAGCACAAAA AA 
(2) INFORMATION FOR SEQ ID NO: 441: 



480 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 
15 TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<5Ci) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 60 

ACGTTCAATT AATTCTTCAA CCATrGTCTT TTGCGATATA GTTGGTrCTG ATTCAGACCA 120 

AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 180 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGAT-TTCAC 240 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 300 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCATCTTT 420 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 4 80 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 54 0 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTCTTACCK5 600 

TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 780 

AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 840 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 900 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 960 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 1020 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 1080 
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(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

ATGTnGGCGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 60 

TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 120 

ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 180 

CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 24 0 

AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 300 

AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 360 

ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 420 

2^ ATTAATAAAA TTCACATAAA ATTCAAATTt stCrAcCAAA AATTTTTGGT TGGyTATTTT 4 80 

TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 54 0 

CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 600 
GTCCAGTG 

<2) INFORMATION FOR SEQ ID NO: 44 0: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 6 0 

4^ AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 120 

TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 18 0 

AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 240 
ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 300 
GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 360 
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(2) INFORMATION FOR SEQ ID NO : 43 8: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(XI ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO; 


438 : 






ACAAAATCAA 


TCAGCAAAAT 


txA 1 1 CjO ATTA 


AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 


60 


AAACAGTTGG 


X X rvV X ^A J. 


v-AAAATATAG 


AAATATATAG 


TGAAcGtTgA GATTTATACC 


120 


ATACACTCGT 




t-AAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 


CCAATGAAAA 




AAT G ATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


240 


AAAGACTAGC 


ACGAGAACTT 


CACGATTCTG 


TTAGTCAGCA 


ACTTTTTG CG 


G CTr»A.GTATGA 


3 CO 


TG CTATCTG C 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 


AGAGAAAATG 


GTTCAAGATT 


CGCAGTTAGA 


AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACC 


420 


GCTTGGTTTA 


AAAGACAAAT 


CTTTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 


AAAAAAAGTG 


CCAATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC 


CTAAAGGTAT 


540 


J. VjAAviATCAT 


TTGTTCAGAA 


TTACACAGGA 


AGCAATTTCG 


AATACATTGC 


GTCATTCAAA 


600 


v-L>0 i ACAAAA 


GTGACAGTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




AAAGGTTTTA 


ATGTTGATGA 


AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 




AGAGCTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGCCAGATTC 


780 


1 At-ACGT 


ATCGAGGTGA AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 


TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG 


CTTCTGGTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 


ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 


AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTJ\A 


AGTATTAATG 


TTAACTAGTT 


1080 


TTATTGAAGA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTJ\A 


1140 


AAACAACAAG 


TGCAAAAGAT 


ATCGCCGATG 


CAGTTCGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 


TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 


ATCAAGAAAT 


TGCTAGTGCA 


TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

^0 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

PiMATGPJKPJ^ GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGCCCAA AATACTTGTT 24 0 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 480 

ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 54 0 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 6 00 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 72 0 

ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 780 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 84 0 

ACAAAATGGT TAGACACAAA AGCATTTAGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAGTA 1020 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 108 0 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 114 0 

GGGGTGGcAA rGAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA 1200 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 132 0 

SO GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 1380 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 1440 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA 1500 
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GTATAAATAA TTTAGTCATT TGCAAGACCT CGCTCTACCA TATATTGTCT AATTGTTTCC 1680 

ATATCTTTAT CGCCACGTCC AGAAATAGTT ACTACAATAA TATCTTCTTT CGACATCGTA 174 0 

GGCGCTAGTC TTTCAACATA ACTCAGTGCA TGTGCACTTT CAATTGCAGG TATAATACCT 1800 

TCATGTTTTG TAAAGTTGAT TAAAGCATTC ATTGCTTGTG TATCACTAGC ATTTTCAAAA I860 

GTTACTCTAC CAATGTCGTG GTAATAAGAA TGTTCTGGTC CAATACCAGG ATAATCAAGT 1920 

CCTGCTGAAA TAGAATGTGC TAGTTGCACT TGCCCATCTT CATCTTGAAT TAAATACATT 1980 

TTAGTACCAT GTAATACGCC AGGTGATCCT TTGCCAATTG CAAGTGCATG TTTATCAGTA 204 0 

TCATCGCCTT GACCTGCGGC TTCAACACCG TATAATGCAA CATCATCTTT AATAAATGGA 2100 

TAAAATGTAC CGATTGCATT TGAGCCACCA CCGATACATG CTACAATTGC ATCCGGAAGT 2160 

CGACCTTCTT TCTTCAATAT CTGTGATTTT ATTTCTTTAC CAATCAGACT CTGAAAATCT 2220 



TCTACATGAC TTACCCAATA TTGCAATGCT TTATTAACTG CATCCGATAA AGTCCCTTGA 234 0 

CCATCTTCAA CTGCCACAAC CTTTGCACCA AGTAATTCCA TTCTATIATAC ATTAAGTTGT 24 00 

TGTCTTTTAA TATCTTCACT TCCCATAAAG ACAACAAGTT CCATATCAAA TAATGCAGCA 24 60 

ACCGTAGCAC TAGCTACACC ATGTTGACCC GCACCAGTTT CAGCAACAAG CTTCTTCTTG 2520 

CCCATTCTTT TAGCAAGCAA CGCTTGACCT AACGCATTAT TAATTTTATG GGCGCCTGTA 2580 

TGATTTAGAT CCTCTCGTTT CAAATATATT TTAGCGCCAC CTAGGCTTTC AGTATATGAT 264 0 

GCAGCATATG TAAGTGGTGT CGCGCGTCCT ACATACTCTG ATAAATAGTA TTCCAGTTCT 2700 

CTTTGAAACT CTGGGTCTGC TTTTGCCTCT TTATAAGCTT TTTTCAACTC AATAATTGCT 2760 

GGCATTAATG TTTCTGGAAC ATATTGCCCT CCATATTCAC CAAAGAAACC TAATTCATCT 2820 

GCTTCTGTTT GTATTTGTTT ATTCATTGTC TCTATCTCCT TTCACAATAT TTACAATTGC 2880 

40 TGTCATTTTT TCTATATCTT TTCGCCCATT TACTTCTATA CCTGATGCAA GATCATAACC 2 940 

TTGATGTGAT AATTTAAGTT GATTAACTGT TTGAATATTT TCAGAGTTAA TGCCTCCTGC 3000 

TATCAAATAA GGTATGTCTT TTATGTGCTT CAAAATAGTC CAGTCATATG TTTGACCGGT 3060 

ACCACCATAC GACACTGAGG GTGTGTCGAT AATAAATAAA TCTACGAACC CTTTATATTT 3120 

ATTTATGTTT TGGATTATGT TTTCATCTnG CAGtAAAGCT TTAGTGATTT TAATGCTTGA 3180 

ATATkTCTTT TTAATTTCCT GTAtAAAAtC AATAGATTCt GtGTAACTGT ATTGTGkTAA 324 0 

wTGaCGwATg CtTAAwACgT GTGCCAATGG T 3271 
(2) INFORMATION FOR SEQ ID NO: 43 7: 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 24 0 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 3 00 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 3 60 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 480 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 54 0 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 66 0 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 7 80 

TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 84 0 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 900 

TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 1200 

^5 TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 1320 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAT^AAACGC TTGTTCTCCA TAATGACAAA 13 80 

TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 144 0 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGATCAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 1560 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 12 00 

AATTTCAAAA CTAGTTCX3AT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 1260 
TATCGCX5AAG TTGllATAAAT ATAT 
(2) INFORMATION FOR SEQ ID NO: 43 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 12 0 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 24 0 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 300 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 360 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 4 80 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 540 

3S ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 600 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 660 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 7 80 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 840 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 900 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 960 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 1020 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 1072 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCGTAT AAGTTGCTGT 4500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACC7VCGTTGA GAAGCTAATT CATTTTTTAC 45€0 

TTTACTGTCA ATTTCTTGAT AAAGAACATT TTTATTTTCT GGGAAGATAA AGTAAGTTCG 4620 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 46 80 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4733 



10 

(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 
^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 
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AATAAATGGT 


AATTTTATAT 


CATCAACGGC 


TACAAATGGG 


GCAGTTTCAT 


ATATAAATAA 


60 


ACACATCTAC 


TCAACTGnTC 


TTAATTAATT 


AATAAATGAT 


TTACCAGATG 


ACATATAACA 


120 


GCCATTATTA 


GnTTAATTCT 


AATAGTTTAT 


TTAAATTTTC 


TTCGGTTGTC 


GCCCAACTGG 


180 


TTGCGAATCT 


AACAACACGA 


TGTTGATCAT 


CGTATTTTTC 


CCAAACAGCA 


AATTTAACTT 


240 


TTTGTTCTAA 


CTCTG CT ATT 


TTCTCGTTAC 


TTAAAATAAA 


AAATTGTTGA 


TTGGTTGGAG 


300 


AATCAAAGTA 


AAGACGATAG 


CCTTTATTTT 


TAAACCCGTC 


TTTCATCTTA 


TTTGCCATTT 


360 


CGATAGCATG 


TCTGCTTATA 


TTAAAATATA 


AATTGTCOGT 


AAATAATTCT 


AAAAATTGTA 


420 


TGCCTGTTAA 


CCGTCCTTTT 


GCTAAAAGGG 


CACCGTGGAT 


GCTTGATTCG 


AGTGGTAAAT 


480 


TGTTTCGGTT 


CATTATTTTT 


CGTAAAAACA 


ATGGCTTCCC 


CGCATAATGC 


ACCTATCTkC 


540 


GTACCACCTA 


TATAAAATAC 


ATCACAATAT 


TTAGCGrTGt 


CTTTAATAGT 


CATATCTGAT 


600 


TGGTCACTCA 


TCAATCCATA 


CCCTAATCGT 


GCACCATCCA 


TAAATAATGG 


AAGCTGATAT 


660 


TGCTTACATA 


CTTTGCATAA 


CTCTTCCAAT 


TCTGaTTTAG 


AGTATAATGT 


GCCATATTCT 


720 


GTAGGATGAG 


AAATATATAC 


CATTCCTGGG 


AATACCATAT 


GGTCCrmT 


AAAATCACTT 


780 


TTAAATGTCT 


CCATGTAAGT 


TTCAACATCT 


GAAGCACTAA 


CTTTTCCTTC 


CTTAGAGGGT 


840 


ATAGTAATTA 


CTTTATGTCC 


ACTATATTCA 


ATTGCACCGC 


CCTCATGCAC 


AGCAACATGA 


900 


CCAGTGTCTG 


CTGAAATGAC 


CCCTTCGTAA 


CTTTCTAACA 


TTGAATTAAT 


AACAACCTGA 


960 


TTGGTTTGTG 


TTCCACCTaC 


TAAAAAACGA 


ATTGTAGCAT 


TTGGaCAGTC 


AATTGTATCT 


1020 


TTAATCTTTT 


CAATTGCCTG 


AGCTGTGAAT 


TGaTCAAAGC 


CATATCCCGA 


AGCTTGTACA 


1080 
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CAATTAAGAA CACATATTGA 


TGAAGAAACT 


GGTATTAAAG 




2700 




CTACATCCAG 


TGGAAATTAT 


TGAATCTATG 


CAAAAGGTAT 




2760 


5 


ACAGTTGATG 


TTGGAAGTCA 


CTATATTTGG 


ATGGCACGTA 


ATTTCAO A T^Ci TTA r' A A TfT^ a 


2B20 




AGACATTTAT 


TATTTAGCAA 


TGGTATGCAA ACGCTTGGTG 




2880 


10 


TCAGCTGCAC 


TTGTGCGCCC 


TAATACGCAA 


GTTGTGTCCX3 


npGfTrf^^ c^rx a tcit* ttit* ^^t^t* i* 

•i A VJV- X \J%9V»ULrV X \J\3 X ViTVjV^ XXX 


2940 


TTATTTTCAT 


CACAAGATTT 


AGAAACGGCC 


GTACGTAAAA 




3 000 




ATTTGGAATG 


ATGGAAAATA 


TAACATGGTT 


GAATTCCAAG 


AAGAAATr? A A AT AT* A a Ar«r<T 


3060 


IS 


TCGTCAGGTG 


TAGACTTCGG 


TCCTGTAGAT 


TTTGTAAAAT 


ATfiPAOAATP A 'VT^*n/^ r*r^ r^r* 


3120 




AAAGGTTTAC 


GAGTTACTAA 


TCAAGAAGAA 


TTAGAAGCGG 


CAATTAAAHA r;/'!!P/-^A'T»r«n ?v 


3180 




ACAGATGGTC 


CAGTATTAAT 


TGATATACCT 


GTAAATTACA 


AAGAT*AAT*AT r*A A A/''*T"i*'i*/"<a 


3240 


20 


ACAAATATGT 


TACCTGACGT 


ATTTAACTAA 


AATAAAGATA 


AAnrnTTAAAr; Ar^Ar^lv^y^Tv 


3 3-Q Q 




GATTTTATGA 


CTAATGTTTT 


ATACCAACAT 


GGTACATTAG 


GCLAPATTAAT f'ViPAr^r* attr 


3360 


25 


TTAAAAGGAA 


CTGCATCAAT 


AAATGAATTA 


TTGCAACATG 


^ VJA^ X X AVjtVj 1 a 1 i. J\sSA 


3420 


CTAACAGGTT 


CAAACGGTGA 


GGTAATCTTT 


TTAGATGGAA 


'v*^^ i J. V-A i Ij V_-AAA aTjAA 


3480 




CATAAAGAAT 


TTGTAGAATT 


AAAAGGTGAC 


GAGTTAACAC 


C'*AT'ATY2PA AP Tr"T*A A/Ti\ >^ 
^^-f* v>\-AAU lolAAt-lAAA 


3 54 0 


30 


TTTGTAGCAG 


ATACAAGCTA 


TGAAACGAAA 


GATAAATCTT 


C!AGAAf5PAf^T TmrspAr* a a 


3600 




ATTAAGGAAA 


AGATGTTGAG 


TGAAAATTTA 


TTTTCAGCAG 


TAAAAATT^P Arsnp'PTA'PT*^ 

A rw\r\n A A A V- rWjVjV,^ X A A i, i a 


3 660 




AAAAAAATGC 


ATGTACGTAT 


GATGCCGGCT 


CAAGAACCAC 


CTTATAPAPfi TT^AATPPAT 


3 720 


35 


TCAGCTAGAA 


GACAACCTGA 


ACAAACTGAA 


ACGTATGTCA 


• ''^vjw A A Vi^nVi? X v»o X X VjVj X X X v. 


O "7 O 

3 7 BO 




TTTACACCAG 


AATTATTCCA 


TGGTATCXK3A 


TCAGCAGGAT 


TTCATXSTAPA CTTTCZ Pf2 A A T 


J o40 




GATGATCGTA 


ACTTTGGTGG 


ACATGTCTTA 


GATTTTGAAG 


TAGAAGATGT TAAART AO A A 


Q 


40 


ATCCAAAATA 


TAGAAACATT 


TGAACAGCAT 


TTTCCAATTC 


AAGATAAAGA 'PTTPAPTAAA 






GCAAATATTG 


ACTATAAAGA 


TATTGCAGAC 


GAAATTAGAG 


AAGCTGAATA ATrSAATPPAP 




45 


AAATATAATG 


ACGGTTTATG 


AAAATTGACT 


TCATAATGCG 


CGATTTAGAA ATOATAPTT^T 

^ VJ-f* AAA i^WvV X VJLTV X AV7 XXX 


^ ft O A 


GTAAATATGA 


TTAACCATGA 


CTACAATAGA 


ACAAATATAT 


'1*'1'ATAAT*TAP rJT'PT'A Ar^fa tv 
A A M A nrv X X O X ^ X AAv7 X AA 


414 0 




TAAAATAAAT 


CCCTTCACTA 


TTAGCAGTAG 


TGAGGGGATT 


TATTAGGTTC CAGATATTTG 


4200 


SO 


AGATTTGCTG 


TTATGTTTAG 


ATTATAAATT 


GTGGTACACA 


CTCATATAAA ATTTACTATT 


4260 




GTATAGGCCA 


ATCTGTTACT 


ACGAGAAGCA 


AACAACAATA 


ATTTACAAGT TCAATAACTA 


4320 




AAAAGACAAA 


CGCCAATTTT 


TCAGCGCTTG 


CCCTATAAAA 


CTATTTTCAA ATTATTATTT 


4380 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 900 

CX3GATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 960 

5 GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 1080 

TATTTAATGA GTATCTTCTT 7VATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 1140 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

IS 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 1380 

. TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 144 0 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 150 0 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 174 0 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

30 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

35 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

40 CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

45 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 2340 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 2400 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 60 

SO 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2580 

55 
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TCATTCTCAT CATATTCTAG 


GTTGTTTTTA 


ACAAACTAAA 


TATAGTGAAT 


GCAAATCAAC 


1800 




TATTATTTAA 


ATTATGAATT 


ATTTTAATTC 


TTTCTTCTAC 


GAGCCAATAA 


CATTAATCCA 


1860 


5 


GCAATTCCAA 


TTATACTACT 


AAAGATCAAA 


CCTTTTTGCG 


TGCTTTCTAA 


ACCTGTTTTT 


1920 




GGTAATTCTG 


CTCGTTTTTT 


CTCTTGATTA 


GCTACTGATT 


CTTTAGCAAT 


TTTAGATTTT 


1980 


10 


TTAACTTTAT 


CATTTTTATC 


CATTGAATGA 


ACTGGGCCAT 


TTGGTTTTGC 


TCTGTCTTTC 


2040 


GATAATCCTG 


GATTGTTAGG 


ATTTACTGGG 


CCACTTGGAT 


GAGTTGGTCT 


GCTCGGCTTC 


2100 




TCTGGGTTTT 


CAGGTCCTTT 


TGGATCTTTT 


GGTTTCTCTC 


CACCGAACTC 


TACAATCTTA 


2160 


IS 


TCTACTGGTT 


GTTTtGTGAT 


CTCTTCTGTT 


GGTTGACCCT 


CGCCAACTTT 


TTCACCTGTT 


2220 




AATGGGkTCA 


CTGTGAwTGG 


TGkTGTGAwT 


GTCyTACTTC 


CTGGkTGTCC 


TTCTTGTTTC 


2280 




ACTCGCTCTT 


CACCAGGTTG 


TAAT 








2304 


20 


(2) INFORMATION FOR SEQ ID NO: 433: 








2S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








30 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 






GTCTTAATAT 


TTTACAAGAT 


ACAGCCTGAC 


AATATTTATA 


TTACAAACCA 


ATATGGTAAA 


60 




GAGGCTGTGT 


CTCGATTAAA 


AAGAATTTTA 


GAGCAAAGTA 


TACCTAATAA 


AGAATATATT 


120' 


35 


GATTTATCAG 


AGATATCATT 


AATTACATTT 


TTTAAACCAG 


AATATAATAA 


ACAACATGTG 


180 




AATGAGGAGT 


TTAATTCTTT 


ATCTTCTTCG 


AAAATAAAGA 


ATTTAGCCAA AGAAAATGAT 


240 




GGTATTATTA 


TTTATTTAAA 


TTTTAAGGAT 


GTTAATTATC 


AACTGaTAaG 


TGAAGGCAAT 


300 


40 




CTGCAAAACC 


ATATTTGCAT 


TGTGTGTTCA 


ATAAAGATAT 


TTTAAATATG 


360 




GATAAAGTTG 


ATATAGAAAA 


TTTTTTCAAA 


AGTATAGAAG 


TCAAATACAG 


TAAATAAGAT 


420 


HO 


CAATTATTGA 


ATCCACCGAC 


CAGCAGATTT 


TTTTAAAAAA 


TTAATACCCC 


GTTAATACCC 


480 


TTTGCTTCAA 


TTTGATGAAA 


ATCAATGAAA 


TTCAAAATTG 


AAGAAATCCT 


TTAATATCAA 


540 




GGTGTACGAC 


AGTCTATATA 


ATCATGCGAA 


ATTCTAAAAT 


TTTCtGAcGT 


AAAAAAACAA 


600 


SO 


ATTCyTAAAG 


CAGCTCGTCG 


TTCACCTCAA 


TTCTCAAAAC 


GTTAATTGTC 


GGACGATATA 


660 




TATACAAAAC 


ACCTCGATGT 


TATGTCGAGG 


TGTrrrrri'G 


CGTTTGTGTG 


GGGAATATGG 


720 


55 


AATAGAGTAT 


AAATGAATTA 


ATGTCTCAGG 


TATAGAATTA 


ATTCAACTAT 


GAATTATTGA 


780 



1334 
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(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 432: 





TTGTACGAAT 


ACAAAGTTGC 


ATTAATATAG 


AATCATTTAT 


GCriTTTAAT 


ATCACAAAAC 


€0 


5 


AAATTGATTA 


AACTAGCTAA 


AATAGTCAAA 


ATTGGAATCA 


AGATTACATA 


AATATCATAA 


120 




CCTCTAGATA 


TTAGTATACA 


TAATATAATT 


AGCGACCCTA 


TTATAATTGC 


TAAATTATAA 


180 


10 


GCCAAACTAA 


TTTTATATTT 


CATTTCAATT 


CCCACCTTAA 


TAGCATTTAT 


TAAAATTCTA 


240 


TAATGTTTAC 


ATAGACTTCT 


TAACAAACTT 


TAACCCAGCT 


AAGCTAAGTA 


TTGAAATAAT 


300 




TACACCTCCT 


AATATAAATA 


TATTACTTAA 


GCTTAGTAAA 


GGTATAATTA 


TACTTATCAG 


360 


15 


TCCTAAAGAC 


AATGTATCCG 


CTGcATAATT 


CGaTGtAGAT 


GAGATACTAA 


ATACTTTCCC 


420 




CATCAAATGA 


TTTGGOGTTT 


TTATTTGAAT 


GGCAACTGAT 


CTTGTTAGTC 


CCTCTATAGA 


480 




TTGTCCAAGT 


CCCAATAATG 


TTGCACCTAT 


ATATAATATC 


GCCACACTTG 


GAAACACATT 


540 


20 


AATAATCGAT 


AAGCCAATTC 


CCCAAACTAA 


AACACCAATA 


CTAAATTTAA 


AGATTAATCG 


600 




CTTTTCTGAC 


AGCAAACCCA 


TAATCAATGA 


CATTAATAAA 


GATGCTATAC 


CTAAACATGA 


660 




TGTAGCTAGT 


CCATATACAC 


CAACGCCCTC 


TTTTAATATA 


TTGGAAATAA 


ACAATGGTAA 


720 


25 


TACAACACGC 


CAAAGGCCAG 


TATTAATCAA 


TATGCAGGCA 


AATTGGATGA 


TTATAATAAA 


780 




TGGAATTTCT 


TTAGATTGTT 


TCAAGAATTC 


CCAAGTTTCA 


GAAAAATCTT 


CTTTTGAGTG 


840 


30 


TCTATCAATC 


ATGTTGTTAT 


TTGTATATTT 


TAAAAGTGCA 


TTAAAAATAA 


ATCCTAAAAA 


900 


TAGCAATATA 


CTACAAATAA 


AAAAGACGCC 


AACATTACCA 


ACTAGTATTA 


CAATGACACC 


960 




AATTAAAGCA 


GGTAAAATAA 


TATTTGAGCC 


TCTTTGCAAA 


CTATCGATTA 


ACGCATTACT 


1020 


35 


TGTTGCTAAA 


TGCTCCTCAT 


CAATAATTTC 


AGGAAGAATT 


GCCCTAAACG 


CAGGATCCGT 


1080 




ATAGCAGTTA 


ATAATGGTGA 


TAGCTGTAGA 


TATGGTTAGA 


AGCGTCAGAT 


AATTTAAATT 


1140 




TGATGTTATT 


GCAAGTAAAG 


GAATAATTAT 


TATAATCAAA 


CTTAGTATAA 


GATCAGATAG 


1200 


40 


ATAAAGTATT 


TTCTTTTTAT 


TATGTTTATC 


AGAATATGCG 


CCACCGAAAA 


TACCAAATAT 


1260 




AATAAATGGA 


AGTGTTTGAC 


TCATAACCAT 


CATTGATAAT 


TTTAAAGATG 


ATTGGTTTGT 


1320 


45 


CAATTCAACA 


GTAAACCAAA 


TrATTTGTAA 


CGAAAACAGC 


ACAAAACAAC 


TCCGACGTAA 


1380 


GArATTACCA 


ATCAATAAAT 


ATGTAAAGTT 


TCTATTTTTC 


AAAACTTCTA 


AATACAACAT 


1440 




ATTTATCACC 


TCTCATAAAA 


ATAATTGAAT 


GCATCCACCA 


GCTTTTTTAG 


ACCTTCTTCT 


1500 


SO 


AAACTCTCTT 


TATCCAAAGC 


GCAATTAATT 


CTAATATAAT 


TTAGTCAGTT 


AAATATCAAT 


1560 




TATTTCGAAA 


TATACATACT 


ACTTGAAACA 


CCATACATAA 


CCCCCAAAAT 


GACTACTCAG 


1620 




AGGTTATATT 


CTACTAATTA 


TGATTATATT 


AAATATGAAA 


ATATTATCAA AAA7U\TCA7U^ 


1680 



55 
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5 



10 



IS 



20 



25 



30 



40 



TTTCCAAGAG 


AAGTTGCGGA 


AGTAATTAAT 




ATA ATA a&»PT* 


GGTCATTTCG 


60 


ATGATTTCGk 


CACAAATCGA 


TGCGGATAGA 


ATYirS H T'T* H T'T^ 
± A x X X X 


T A O A 71 O^TV* A 


TGCGTATTTC 


120 


ACAGGTGTAT 


CATATGGTGC 


TTTTGATATG 


GAGCGTATTT 


TA An ATTli HT 


Ot^GACC X A CT 


180 


AAAGATGAAG 


. TACTAATCAA 


AGAAAGTGGT 


ATGCATGCVAfS 


Tm A A A & CT*^ 


1 Ax X Aa G AGT 


240 


CGTTATCAAA 


TGTATTGGCA 


AATTTACTTC 


CACCCAGTTA 


OTP^^TViTSTVir^ 
w X V^V* X\>Vj 1 V>u 


AkjAACj TG CTG 


300 


CTTAATAATT 


GtTTGAAACG 


CGCAAAACAG 


CTTTATAATt^ 




All IAACjU^G 


360 


CATCCACATG 


ATTTTATTCC 


Al"l"ri"l"rGAA 


GAGACAGTTA 


PfS A TTrt A A O Ji 


Ax Altai iLaAA 


420 


CTCGATGAAG 


CGGTAGTTAC 


GTATTATTTG 


GAAAAATGTSA 




TGATGCTATT 


480 


TTAAGTGATT 


TAGCAAGTCG 


ATTTATTAAT 


CGAG A A T 


TTA & 2VT A'PR'P 
X X/vvri.XAX/\x 


X CCAl 1 iGAT 


540 


GGCTCAATTA 


TTACAATATC 


AGAACTGCAA 


GAACTGTTm 


A A P A 1'^/^ TY^ 


TATTAATCCa 


600 


GATTATTATT 


TTGTGAGTGA 


AGCATTTTCT 




A Tr^ H /""T TY^ TS 


TCGACCGGGG 


660 


TCAAATCGCA 


AACCGATTCA 


TTTATTAAGA 


^vTVWJAi X X r\ 


^-VjAI Xi\QjAQjA 


AAT AAG CAAT 


720 


CAATCATTAG 


TCATTCATAG 


TATTACAGGC 


ATT A ATr^f^ rT' 


AAtaAt- X AXAA 


ATTATATTAT 


780 


CCTAGAGAAA 


TGGTTGCAAA 


GATTAAAGAT 


AArSAf^AATTA 


vj/V\j AAva t, 1 Ax 


TGAAAATTTG 


840 


ATTAATGAGC 


TTAATTAAAC 


AGGGCTAAAA 


TmTT A TOnT 
X X V,9 X X X X 


X AAA 1 A 1\joA 


GGTTATATCA 


900 


TTGTCTGAGA 


AAAAAGGCTT 


TAATTTTAAT 


ATPATAAAAA 


Ax oAL.\,U id 


AGATGGTCaT 


960 


AAAGGTACAA 


ATATTGGTTC 


AATTAGCTTA 


GACAATATTH 


^..>^V>v.,iA^ X X 1 i 


1 ATL.GATGTT 


1020 


GCTAACAAAG 


AAGCATTTAT 


TGATATTGGA 


GGCATGCATG 


\» X X K3\^ L.AA 


Av» i i AAAAA 


1080 


GGTGTGAAAT 


GGATTACTGA 


TAAAGCTGCT 


GTTGAAGG CG 


ATf^ A Ar^f^Aa 
AX OAAVjC X aa 


A^>aATATTGG 


114 6 


TTGTGTTGGG 


TAACAACAGA 


ACX3TAATGAA 


CAAGGAPPAT 


A TT A rVSl-'nYl/^ 
AX X ALAjf^ X\jV» 


1 1 lAACAGCG 


1200 


TGCTATTTAT 


TAGTGAATAA AGCAATTCGT 


CGTGGTTATA 


A A Arf^TATl^^^/^ 
AAAva X A X o 


IXiAACATG 1 1 


1260 


AATATGATGG 


ATAAATCAAT 


GAAACATCAT 


ATTATr*ATar^ 


AJTv-AAATTGG 


TGACGAGAAT 


1320 


AAAGCTATTT 


TAAAAGACTT 


TTTAATGAAC 


CATGATGAAG 


GTATGTGGAA 


GCATTCTTCT 


1380 


GATGCTTTAC 


ATCAAGCATT 


TAATTAAATA 


TTAGAAACTA 


AAATTTCCCA 


ATTAATCTAT 


1440 


AAAGATATGA 


TTCATTTCTC 


AATGAC 








1466 


(2) INFORMATION FOR SEQ ID NO: 4 32: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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25 
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AAACCATTGT TGTCAACITG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 1020 

ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 

GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 114 0 

GGtATTGCAA 1150 

(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 97 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

20 AAGPiATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 120 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 180 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 24 0 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 420 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 4 80 

AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 54 0 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 780 

TAATAACnTT TTGGATn 7 97 

45 

(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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CATTTCAAAA AATTTGAAGA 


* vjj^ X X X X fu\r\ /v\ X f\\. 


AAGGGGCTTG 


GGTGAAAAAT 


420 




GCCACAGACG AATTAAAAGA 


TATTACTAAT nATTTar* a a jv 


AAATTCAAGA 


TATTAAAGTA 


460 


5 


TAAAAAGGTA TTAAGAAACA 


ClAATAAGTAT &T*2i & Trr* & t*t^ 


TAATAATAAA 


TGAATTATAT 


540 




AGTTCATAAT TTCGACTATA 


*-vva X vjV7^ X#^X 1 X A 1 1 


TTTCGGGATC 


TATGTCAAAT 


600 


10 


CGGACTAATG AATTCAATAA 


TGGAAC^JT^AA dCTi^nr^A KTr^ 


TTTGTTTAAC 


TTCTTTTTTA 


660 




Tl"l"i-riX3GAA AATAAAGTTT 


TGAACATAAT AAAATTTGAT 


TATGTTTTAA 


CXSAATTTTGA 


720 




TGTTTCTTGA ACTATATATC 


ATCTAGTCGT CATTTACAAT 


TGGTAAATAT 


GACTTCAAAC 


780 


15 


TGkATGAAGG TGATGGCGAT 


TAAAAGGCTC ATCCGTAGGT 


TCTAAAGAAC 


TAGAnTTT 


638 




(2) INFORMATION FOR SEQ ID NO: 429: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 





. (xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


429: 








nGTGTTCAAA 


TAATAGAATG 


GCTAATACCC 


CAACAATAAA 


TAGGAATAAA 


TATGTTGATG 


60 


30 


TCGTAATTTT 


AGTAAATAAT 


GAAAATCTAA AATTAGTAAC 


CCTATTTTGA 


ATATAAGCTT 


120 




TAACTTCTAA 


TAATACTGGG 


AAGCCGATTG 


ATCCTAAAAT 


TATTAAAAAT 


ATAACTATCG 


180 




CTTGTACAAA 


ATAATCATGT 


GCATAAGGAA 


TTAATGACTT 


ACCTGTAATA 


TCTAATCCAC 


240 


35 


CATTGGTAGT 


GGCAGAAATA 


GAAACAAAAA 


CACCTTGCAT 


GATTGCATAT 


TTTAAATCTG 


3 00 




GATTATCTCG 


ATAAAAGTAA 


AATGCTAATA ACATGGCTCC 


TACGAGTTCG ATTACAAATA 


360 




TTGATTTTAC AATATCAATA 


ATCAATTTGA 


CGGTACCACT 


CATTGTGTTT 


TTGTTATTAT 


420 


40 


CTAACATAAT 


TAACTGACGT 


TCTCTAATTC 


CAATATGTTT 


ACCTAGTACC 


ACCCATAACA 


480 




TCGTACGAAT 


TGCCATGACG 


CCAATTCCAC 


CAATATTTAA 


TATCACGAGG 


ATAATT/UVTT 


540 


45 


GTCCAAATGT 


AGAATAGGTA 


TCGACAATAC 


TTATCGGAGA 


CAATCCAGTA ACACTAATTC 


600 




CGGATACGGC 


AACAAATAAT 


GTGTCAATTG 


GATTTACTTC 


TACACCTGGT 


TTATGaACAT 


660 




ACGGTAAGTT 


TAATAATAAA 


AATGCAATGA 


CAATTGCGAC 


GATATAGTAC 


AATACAATAC 


720 


SO 


CTTGTTGAGG 


GCTTGATCTT 


TTTAAAAACT 


GGCTAAAAAT 


TGACACGTTG 


TTCACCTCAA 


780 




CATTATTTTA 


GTTTAATATT 


AATTTCTTTC 


TCTTTACCAT 


CTCTATAAAT 


CTTCGCTGTA 


840 




ATTGATTTCA 


AGTCATCTTT 


ATGACTAAAT 


ATAATCTGCC 


TAAAGCGTAA 


ATCATCTTCT 


900 



55 
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CCCTGTTTCT TAGCCTTTTC GTATATGCGT TCATAAGAAG AATTCTTTTT GATTTCAGAT 120 

TGTAAAGCAC TCTTTTCACT TGATTGTTTA TCTATTTTAT AATCTAAATC TGCAATCTTT 180 

^ CCTCX5CGTAT CATACGCATC CATTTTTAAA GATAGCATAT AAATACTTAA CATAGCAATT 240 

ACAGTAATCA AAGTTATGTA TAAAACTTTT TCAAATTTAG TTAATTGTAC AACCACTTTT 300 

CTCGAAACAG TCTTCTTTTC GGGCTTAGTT TGTGGTTGTT GCTTCGGTAT ACTATTATAA 360 

10 

ACTTGTTCGT CATATGGTTG GTACACTTTT TCTACAGCCA TTATAAATTG CTCCTTATTT 420 

AAGTATTTCA GCTACACGTA ATTTCGCGCT TCGTGCTCTG TTATTGTCAT CTAAATCTTC 480 

TTCTGTAGCG GTAATCGGTT TACGATTAAC ACGCTTTAAC TTAGGTGTAT ATGCTTCTGG 540 

15 

TATAACTGGT AATCCTCTTG GTACCTCTGG ACCTTTTTCA TATTCTTGGA ACACCTGTTT 600 

ACATAAACGA TCTTCTAAAG AATGGAAAGT GATTACCGAA ATCCTGCCAT CTACTTTCAC 660 

TAATTCAATC GCTTGTTCTA TTGAATCTTC AAAAGCTGAC AATTCATCGT TTACTGCAAT 720 

20 

TCGTAGTGCT TGAAATACTC GTTTTGCAGG ATGTCCGCCT TTTCTTCTTG CTTTTGCAGG 780 

AATACCTTCT TTTATAATGT CAACTAATTC TAATGTTGTT GTTATTGGTT GTTGTTCGCG 840 

2s ATGTGCTTCG ATTCTTCGAG CTATCTGTTT TGAAAATTTC TCCTCGCCAT AGCGATAAAA 900 

AATCTTCACT AACGCTTCAT ATGACCAATT GTTAACAATT TCATATGCTG TTAGTTCTTG 960 

TGTTTGGTCC ATACGCATGT CTAATGTTGC GTCATGGTGA TAACTGAATC CTCGTTCTGG 1020 

30 AATGTCX3AGT TGTGGGCTTG AAACACCCAA GTCGTAATAA ATTCCATCTA CTTTTTCAAT 1080 

GTTTAAGTCT TTTAATATTT GAGTTAATTC ACGGAAGTTG CTA 1123 
(2) INFORMATION FOR SEQ ID KO : 428: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 



45 



50 



TATATCCTCC 


AnACCATAAA 


AATGGAATTA 


TAGTGAAGAC 


CGCTATGATC 


CAGGATGACC 


60 


ACGTATTATA 


GAATCTTTCg 


ATAGAGAAAC GAATCaACGT ATCCgACATC 


aCGAAATGAA 


120 


ATTAGAAGAT 


TATAAAGATG 


AGTTAAGAAG 


AGAATATCTA 


AAACAATCTG 


ACAGAATTGA 


180 


AGGAGATGAA 


TAAGCGTGGG 


ACTTGATTTT 


AGTGGTTTAC 


CAGATTTAGC 


AGTATTGGAA 


240 


CAAATGAAGG 




GATTAGTGAG 


GTTATTGCGC 


CTGAACATGT 


TCGTATGCAT 


300 
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TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 
TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATACTCC TCATCAAATG ACAATATAAA 
5 TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 
ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 

10 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAATrrTT CAATTAGAAG 
CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTrGG 
CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 
ATCAGCTWTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 
ACCTAtCAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 
TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTITATTA ATTACAACAT 
TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 
ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATX3CAAATG CTTATTTAAA 
CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 
GTGATACCTT CTTOTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 
TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 

3^ GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT ATGAATTAAA 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACrTTTT CATTATCTTT GCTATTAAAT 
GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AAC7VCTTGCT 

35 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTGATGTT 

TCTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 
CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 

40 ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 

(2) INFORMATION FOR SEQ ID NO: 427: 



(i) SEQUENCE CHARACTERISTICS: 
4S (A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : double 

(D) TOPOLOGY: linear 



so 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



55 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTAGT ACTAGTACTG CAAAAATAAA 
TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 
ATTTGATTTT T C TTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 
TTTAAATCAT ATATGAATTT AGAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 
CATTtAAGAG ACCGAACAAG TAATCATACT TTATTTTGTT GCATTATATA ATTCGTCAAC 
TTTTTCCCAG TTT^CTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 
TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTGTTT TACCCTCAGT 
TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 
CCAAGCCCAA CCTGAACCAA AGCGTGCAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 
TTCTAAAGAA CCCCATTGTT CTTTAATTTT TTCTACTACA GTACCTTTTT CTTCTGAGTT 
TGGTGAAAGT AACTCCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 
AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 
AGATTCTAAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 
TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 
GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCCTAAAA TGTCTGTAAG 
TAAATAATAA CAAGCAGTGA CTGGTTCAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 
ATTTTCTCTA CTCATTTAAT ATAACTCAAA TCAAAAATAA TTAAACATTT TGTATATAAA 
AAGTTAACAG ATTTGCCATA AAATCATACG AACGGAGTAT G7UVATGAACC TTTATCTTCT 
ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 
TTATATTAAT CTTGGcAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTGTGAATA 
TTAACACCAG GTAGTGATAA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 
TCTCCACACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 
TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATTCCTA AATCTTTAAA TAAGTGCAGG 
TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 
TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTACGTTTA 
TCTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CATCATTTGT ATTCATTGAT 
ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 
AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAGCTAGGAC 
TAAGCCTAAT ATTACTGATA ATTCACCTAA AAATACACTT AGTAATATCA ATTGCTTAAA 



300 
360 
420 
480 
540 
600 
660 
720 
7B0 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
AGAACGAACC TTTAATACGT ACGTGTCGAC CACGAACTGT CATGACAAAT TCATAAAACG 
AATAGTTTTT CATTAGTTCA TTGTCACATC AATCACTTTT GTkTCACCIT TAATCACAGC 
ATTTTCATCA TAAATATTAA TTGAAGCTGC TTGATCAGTG TTAGTAATTA TAATTGGTXSA 
AATTACAGAT TTAGCGTTAT TATTAATATA TTCAAGGTTG AATCTTACTA ATGGATCTCC 
GACGTTAACT TCGTCACCAC TAGACACTAA CACTTCAAAT CCTTCACCGT CTAATTGAAC 
^c^AUATUAA CTAATAATTC TAATCCGTTA TCTGCTTTTA ACCCAATCGC 
ATGCTTAGTT GGAAAGACAT TGTCAACACG TCCTGCAATT GGAGACACAA CTTCTCCTTC 
AGTTGGATTA ATACCAAAAC CTTCGCCCAT CATTTTrrGT GCGAATACAG GATCTGGAAT 
ATCTTCAATT TTCACGAATT CTCCAGTTAA TGGTCCATAA ATTCCGATAT CTTTCTGAAC 
TTCOTTGCCT TTTCCGAATA . ATTTTTTAAA CATACTTTCC ACTCCTACTT ATCAAAATCT 
GATATTAAAT CGCCATAACC CAATTCTTCT AACnTTCAT ATGGAATAAA TTGAAriGCA 
GCGGAATTGA TACAGTATCT TAAGCCGCCA CTTrCTTTAG GTCCATCAIT AAAGACATGT 
CCTAAATGAC TATTTGATTC TTCTGAACGC ACTTCAGTTC TCAACATACC AAATGATTTG 
35 TCGACTAATT CTATAA 

(2) INFORMATION FOR SEQ ID NO: 426: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 
TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTX3TT GTTTTTCGAC CCCTAGTATA 
TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTCT CTAAAATACG 



60 
120 
180 
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TATGACTCGT GAAAATTTAG TGACATGAAT AAAAATGTTG AGGCGTCATT GAGTATAAAG 1560 

GAAAGTAGTT CTGCATTAAT CACGAaGTAG AGCATGACmA CX5AGGAATAA CTATAGGGaG 1620 

ATGGTTTTGG AATGACGAtG TCTTGTATCA ACATGGTACA TTAGGTACGT TAATGGCTGG 1680 

CTTACTAGAA GGCACAGCTA CAATTAATGA ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

AACGTTAACA GGGTCTGATG GCX^AAtAATA TTTTTAGACG GAAAGGCATA TCATGCTAAC 1800 

GAGCATAAAG AATTTATAGA ATTAAAAGGC GATGAGAAAG TACCGTATGC ATCGATTACT 1860 

AATTTTAAAG CGAGTAAGAC ATTTCCATTG CAACAATTAT CACAAGATGA TGTATTTGCA 1920 

CAAATTAAAA ATGAAATGTT AAGTGAGAAT TTATTTTCGG CTGTTAAAAT TTATGGCACA 1980 

TTTAAACATA TGCATGTACG AATGATGCCT GCTCAGCAAC CGCCATATAC ACGTTTGATT 204 0 

GATTCAGCAC GCAGACAACC TGAGGAAAAA AGACAAGATA TTCGTGGTGC CATTGTTGGA 2100 

TTTTTTACAC CAGAATTATT TCATGGCGTA GGGTCTGCTG GTTTTCATAT ACATTTTGCG 2160 

GATGATGAAA GAGCTTATGG TGGACATGTT CTTGACTTTG AAGTGGATGA CGTTGTCGTT 2220 

GAGATACAAA ACTTTGAAAC ATTCCAACAA CATTTCCCGG TAAATAACGA GACGTTTGTT 2280 

AAAGCGAAAA TAGACTATAA AGATGTGGCA GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 234 0 

ATAATTAAAA GACCTTAGCG ATATTAGGAA CAGGTGGTTC TAAATGCATC GTTAAGGTCT 24 00 

TTTTATATTA TATATGTGCT TACATATTTT GTTGATACGC GCGTAAAAAG CTATAAATAG 2460 

TATCAAGATA TGGGGTGTCA AGAATGTGTT CACGTGCACG TCGATAAATA AAGCCTTGAA 2520 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG CAATGTCGTA ATACATGCTC GTTCCCATTT 2580 

CGTCGGGATA ACCTTGATAT ATAGTCATAA TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 264 0 

ACCCCTCTGC CTGTGCAACA CGACAACCAT CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

TTTCAGGATT ATGCATGATT GCAACGGTCT GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CCAAGTTAAC TAACAATTTA TACCAAATAG CTTGTTGAAT ATTTGCTTCT AATACAATGT 2820 

CGATTTGACT GTCTTGGACC AAATCTCTAA ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

GTAATTGATA ATCTCTGAAG TGCGTAACAA CATCGCCTTT CTTTTGACCA CTTATATAGA 294 0 

CAACTGCTTG GCATACGTTT TTAAATGAAA TATGTTCGAG TTGACCATAA CCATTTTGGG 3 000 

CTAAAATGAT GAGCGTGTCC TCATGTGCAA GATGAGTTAA ATGAGGAATA ACTGCATCAA 3060 

GTTGATGTGT TTTGACTGCA ATAATAATAA CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

CTTTCACAAC AATATCTTGT GCAGGTGCAT GtGGTACAGT ATAATATGTT ATTGTTTTGG 3180 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA ATGATTGTTG TAATTCATAG GCAATAGTTG 3240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 
TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 
TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 
CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 
TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 
TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAGC ATAATATTAT 
AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 
TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 
ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 
TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAGCAG ATGTTCAAGA 54 0 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAGCAG GTGAATACGA 600 
aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AGCCAGGTGA 660 
AACACGTTTA CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 72 0 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 78 0 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 84 0 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 900 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTGCAGTTT GGTCACAAGC 960 
AAATGTAGCA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 1020 
AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAGCTAAAGG 1080 
GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 114 0 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 1200 
CGTTTACCTA GGTGTACCAA CATTAGTTAA CCAACATGGC GCAGTTAAAA TTTATGAAAT 1260 
GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 1320 
ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 
ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 
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GGGTTACTTT 


GATGAAAATG 


ATTGAAGTTG 


ATAGAATTAA 


TGACGTATCT 


TGTGTATAAT 


360 


ACCTACGAAG 


TACTTTCATT 


GGAGGAAAAA 


TAGTGACTCT 


ATTTATTATT 


ATCGGGGTTC 


420 


TCGTGCCAAT 


GGTTTATACC 


ATGCAGTTAA 


ATATTAAAAA 


TGAACCTGTa 


ACAAAGCGCA 


480 


ATCTTTTAAT 


AACATTAGCT 


TTATCTACGT 


TAGGTATTTT 


AGTAACCGCG 


TTAGCAGGTG 


54 0 


TAATCXTTTAC 


GAAACAAGCT 


TTTCCTTTAT 


TAAGTGTAGC 


AATTGGCTCA 


A'J."l'i~i~i'ACTG 


600 


GAATCGTTTG 


GGGCCTTTTA 


CTAAGTGGTA 


GcTAOGCGCT 


GATACGATTT 


TTATCTAACG 


660 


CATTTGGGCG 


TAAGTAATAG 


TAATCTGTTC 


ATCAAGTAGT 


ATCCGTGCTT 


GAAAACAAAC 


720 


TAAAACTCCT 


AATGTGGAAC 


TAGTTAAAAA 


ATTCTAAACT 


AGTACATTAG 


GAGTTTTGTT 


780 


ATGCAGAATA 


AATTTAATTG 


TTAAATTGAA 


AGTGCGGTAA 


AAATCCACTA 


TTTATTTGAA 


840 


AAAAATCGAG 


CAACCAAATT 


AAACCTTGAT 


ATCCTAAGTA 


AATACATAAT 


AAAACAAGTC 


900 


CGACATAAAT 


TAAAAATCGC 


AAAATAGAAA 


GTCCAACTCT 


AAAAAGGAGG 


ATGACTAATA 


960 


GTGCTATTAA 


AATAATTGTt 


AGTATACTCA 


ATGCTACAAA 


CCTCCTArTA 


CGCTTTTAAA 


1020 


TCCATAA 












1027 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 
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CGGCACATTT 


TTAATTTATA 


CATATTTTAA 


AACTAAGTAA 


CAGTTTGAAG 


AAATCGTAGT 


60 


TCAATAATGT 


TAATTGTGAA 


AATGTATATA 


AACATAAAAA 


AATCATGTAT 


AATATATGTT 


120 


GTTAATTAAA 


CAGTTCGAAA 


GCGAGATGAC 


ATTATGGGAC 


GTAAATGGAA 


TAACATTAAA 


180 


GAAAAAAAGG 


CCCAAAAAGA 


TAAAAACACA 


AGTAGAATAT 


ATGCGAAATT 


TGGTAAGGAG 


240 


ATTTATGTTG 


CAGCAAAATC 


TGGTGAACCC 


AATCCAGAAT 


CTAACCAAGC 


TTTAAGGTTG 


300 


GTGCTTGAAC 


GCGCTAAGAC 


ATATTCAGTG 


CCGAATCATA 


TTATTGAAAA 


AGCAATAGAT 


360 


AAAGCTAAGG 


GTGCTGGAGA 


CGAAAACTTT 


GATCACCTAA 


GATATGAAGG 


ATTTGGCCCA 


420 


AGCGGATCAA 


TGCTAATTGT 


TGATGCGTTA 


ACAAATAATG 


TAAATCGTAC 


TGCCTCTGAT 


480 


GTGCGAGCTG 


CTTTTGGTAA 


AAACGGCGGT 


AATATGGGTG 


TATCTGGATC 


AGTTGCTTAT 


540 


ATGTTGTCAT 


GTGGCCACAT 


TTGGTATTG 








569 



55 



*SDOaD: <EP 07865 19A2J_> 



1323 



10 



15 



40 



45 



SO 



55 



EP0 786 519 A2 

GTCTTTTTATC TCAAGATGTG TTACTCAAAA AGTTATAGAA GAAGCTACTA AAOTTAAAAC 
AGAGATTGAT ACTGCAGAAG ATAACTGTAT CTCTCCATCT ACTGTAAGTC GTATTAGAAC 
TAAAGCGGCT AATTCATTAC GAATTAAACC CTTTAATTGT TTGCCAGAAC ACATCGCTAT 
GGATGAATTT AAAAGCXSTTA AAAATGTAAC TGGATCAATG AGTTTCATTT TTATAGATAA 
TGATACTCAT GATGTTATAG ATATTTTAGA AAATAGAACT ACAAGATTCT TGCGTGCCTA 
TTTCGAGCGA TTCGATTTAA AAAATCGACA ACAAGTTAAG ACGGTTACTA TTGACATGTA 
TGAACCCTAT GTCCGATTAT TTCGCGACCT ATTTCCTAAT GCAGCTATTA TTTTTGACAG 
ATTCCATATC GTTCAACATT TAAATAGAGA ACTTAATAAG TATCGTGTAC AAGTTATGAA 
TGAATACCGT AATAAAAAAG GACCTGATTA TACAATTTTT AAGAATAACT GGAAAGTCCT 
ATTGATGGAT ACTAGTAAAA CCATATTTAG TAAATAGAGA TGGAATAAAT CTTTTAAGGC 
TTATAAACGC TCATCTGACA TTGTAGAATT CATGCTTTCA AAAGACGATA TACTACGACA 
CTCCTACGAA CTTGTCCAAG GATTACGAAA AGACCTAAGG TTATGTAATT GGCCTAAATT 
TATTAATCGT TTAAATTCAG TTAGTAAAAA GTCTGTGAGT AAGGGTGTAT GGAAAGTGGT 
TAAATATTAT AGAAAACATC AAAGGATGTT AAGAAATACA ATTTATTACC CAGCATTTAA 



TAATGGTGCT ATAGAAGGAA TTAATAATAA GATAAAATTA ATCAAGTGAA TTTCTTTTGG 
TTACAGAAAT TTCAACAACT TTAAAGCACG TATAATGATG aTTTTCAGCT TGTACAAAGG 
3^ AGaAAAAAAG AAGACAACCA AGCCCAATAA TGGACTGGCC GCCTAATAwA nGGGskCTAA 

AAGTTgTATT TTTAAAAATA GTTCCTTTAA ATTATATACC CnCCACATTT GGGGGAGGAC 
CTAAAAAAGC 

55 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 
CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 
AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 
GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 
GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 



2520 
2580 
264.0 
2700 
2710 
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<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



5 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 





AATTCCCCCT 


TTTTCCCCAT 


CCATTTTnCC 


ATCCACTGGT 


CCTTACGGGA CCATATTATT 


60 


10 


TnAAATTGGA 


nAAAATTATt 


TTAAAGaTTA 


TwACtACTCT 


TtAATCATTT TrGTGAATTA 


120 


AAAAAAGTAG 


TGCAAAAAGC 


AAAATATACT 


TTATACACTA 


CAAATCATTT ATTTATAATA 


180 




AAGTTTCACC 


AAAAAATGTT 


CCAACTAATG 


AAACCGCTTG 


TTCAGCAGTA TGATTATTAC 


240 


IS 


TGTCAATCAA 


TGGATTTACT 


TCAACTAAAT 


CCATTGAGGA 


AATTAAATGT GATTGATGCA 


300 




GTAATTCCAA 


TGCAAAATGG 


CTTTCTCTAT 


AACTAAGACC 


ACCCAAAACT CTAGTACCAG 


360 




TGCCTGGCGT 


TTCAAGCGGA 


TCTAAAGCAT 


CAACATCTAA 


AGATAAATGA ACGCCATCAA 


420 


20 


CATTGCGTGA 


CTTCAAATAT 


TCTATTGTAT 


TTTCAATTAC 


TTCCTTTATC CCCAATTTAT 


480 




CAATATCTGA 


CATAGTAAAT 


GTTTTAATAT 


TATGATCTTT 


GATAAATTGT CTTTCACCTT 


540 




TATCTAAATC 


TCTCATACCA 


ATTAGTACGA 


TGnTTTCTGG 


CTTGATTACA TTACnATTTA 


600 


25 


ATTCTAAAAG 


TTCTTTGGGA 


CCTTCGCCTG 


TCAAAATCCT 


TAGAGGCATA CCATGAATAT 


660 




TTCCACTTGG 


TGACTCTTCA 


GGTATATTTA 


AATCACCATG 


TGCATCATAC CAAATAACAC 


720 




CTAAATTATT 


ATAATGTTTA 


CTTATTGCTG 


ATACTGAACC 


TACCGCAATA GAATGATCTC 


780 


30 


CACCAAGAAC 


TAGAGGAAAT 


CTGTTATTTT 


CAATTGATGC 


TGAAACCTCT TTATTTAATT 


840 




TTTGATTAAC 


ATCTATAATT 


TCATCATAAT 


TTCTTAATCC 


TTTTTGTTCA CTATGAAATT 


900 


35 


TTTCAATGTT 


CACAGCAGGt 


ACCTTAATAT 


CCCCCTTGTC 


ATATACATCA AGGTCTAATT 


960 


GCTTTAATCT 


TGAAATTAAT 


CCAGCATATC 


TAATTGCTGT 


TGGTCCTAAA TCAACACCTA 


1020 




ATTTTCTTTG 


TCCAAATGTT 


GATGGTGCAC 


CTATAATATC 


AATTGCTTTT GTCTTTGTCA 


1080 




TAAGCGTCCC 


CCTTTGCTCT 


ATGTAATTAA 


AGAATAATGT 


ATGCGCTTAC CATTATCAAG 


1140 


CAATAGCTAC 


ACATATAATC 


TGTTTATCTT 


ATTACTTCAT 


AAAAAAAGGT TCTTCATCTT 


1200 




TTATGGTGGG 


AAGGTAAAAC 


TTCCTGCTTT 


TTTTAATACA 


CAAAAAGCGC AATTGCCTCT 


1260 


45 


ATAATTTAAA 


GTGACCAAAC 


CCAAACTAAA 


GGAGACAAGT 


GCGCCTATGT GTAATGATAC 


1320 




CTTAGAATTA 


CTAAGAATAA 


AAGATGAAAA 


TATAAAATAT 


ATAAACCAAG AAATTGACGT 


1380 




CATTATCAAA 


GGAAAAAAAG 


CAACAGTGGT 


TAATGCTGTA 


CTAACGTATA AGCCTTCGGC 


1440 


SO 


CTGTTATTGT 


TGTGGAGTTA AAAATGAAGG 


ACAAATTCAI 


AAACATGGTA AGCGTGTTTC 


1500 




TCGTATTACT 


TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 


1560 
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ATCAGCTACT GAAAATGGTA AGCTTACATT TTTACAAGTA TTTTTTCAAG TCATGTCTGC 
ArrrGGAACr TGTGGACTAT CGCTTGGTCT CACAAGTCAT attagtgata tttctaaggt 
CGTACTAATG ATACTCATGT TTATAGGACG TCTTCGCTTA ATATCATTTA TCATTATGAT 
AGCAGGACGT CGAGAACCAG ATAAATTCCA TTATCCAAAA GAACGTArrC AAATAGGATA 
ATATAATAGC AATCTAAGTr TAGTTAATGT AGA7TTTAAC TGGAACTTAG ArTGCTITrr 
TAGTTTGTAT TTTTAACTTA TITTATAAGA CGATTCGTIT CGAAAATGGT AAACTAGTAA 
CAATGAGAGG TGTAACATGA TGGAAAAAAA TGAAAACATT AATGTAGAGA rtTTAACTAC 
GTCAGATATG CATAGTCATT TCTTAAATGG TGATTA7X3GT TCAAATATTT ATAGAGC^TXKJ 
TACTTATGTT AACCAAGTAA GAGCACAAAA TCATCGCGTC ATTTTATTAG ATAGTGGCGG 
AAGTTTAGCT GGCTCGTTAG CGGCCTATTA TTATGCTATT GTTGCACCTT ATAAACGACA 
TCCAATGATA AAGTTAATGA ACAGAATGCA TTATGATGCT AGCGGTGTGA GTCCAAGTGA 
A-rrCAAGTTT GGTTTATCAT TTTTAACTCG TTCAArPGCT TTGGCACGTT TTCCATGGTT 
ATCAGCAAAT ATTGAATACA ATGTTACTAA GGAGCCTTAT TTTTCAACTC CATATTGTAT 
TAAACATTTT GGTGACTTAA AAATTGCTAT CGTAGGCGTC ACAGCAGA1X5 GTITAATGGA 
AAATGAGTAT TCTGAAATGG AGCAAGATGT ATCTATTGAA AAGACATTAG TGGCATCAAA 
ACGTTGGATT AGATATATCC ATGAAGTTGA AGAGCCAGAT TTTTTCATTG TAATTTATCA 
TGGTGGATTG AATAAAATTA GTAATAGTAC GAAAAATAAA AAGGCAAGTT CGAATGAAGC 
TGAAAAATTA ATGGAAGAAC TCGGTGTTAT AGATITAATG ATTACAGCTC ATCAGCATCA 



2S 



30 



2^ y^^j.^^xKM\\j ATUUCAAAGA 1440 

GCn^TACAT CT^CGATTA ATTTTAAAAA GAGAACAAGA ACTTA^TX^TG TTCAAAGCAT X500 

TGATTCTAAA GTGATTGACT TAAATGAGTA TGAAGAGGAT CAAGAATTAT TAGATTTAAC 1560 

ATTCTATGAT AGAAAAGCAG TGGCTTATTG GTCACAGGAA ATCATAAGTG ATAAAGGTTT 1620 

GATGTTATCA GTAAATGGGT TACAAGATTT AGTCTGTCAA ACACATCCAT nrCGCAATT 1680 

ATTACATGAT GCAATTCACC TTCCArTTGA TAATGATATA ACATCTGTCC ACGTGCCTAT 1740 

GaACGGAGAG AAGGGGTTGA GTGGACAGAT TCGAAATCAA GrTTTGTATC aTCCATACCC 1800 

ATaTCCAGAT aAGCCAtGGG tATGACAATT aGTGGTCAAA ATaTCAAGGT ATnTTGGGGT I860 
ATAGTTATTC ACCATTAGG 

1879 

SO (2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 
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TTCTTGGATA CGTAATAAAT TACCTTTTGT CATAGGATTA AACATAGATC CAGGTAAAAC 120 

ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG ATGTGACTGC CACTTTGTTC 180 

CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG ATAATTCTAT TGCGCTCAGG 24 0 

AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA GATATAACTA AACCTTTTTC 300 

TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA TGTGTTTGAT GGCCAAAGGC 360 

TAAAACTGCA ATTGTAGGGA GATTGTATTT TAAAGCTATT TGATGTGCCA TTGCATCAGC 420 

TCCTTGAGCA AGGCCGGAAA CAATTGTTAA ATATTTGCTT TTATCATTTG ATAATAAAAA 4 80 

TTCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA CCTACTACTG CCAAATGATG 54 0 

CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA AGTAATGGAA ATTGATATAT 600 

TTCTTTTAAT AGCACTGGGT ATTCAGAATC CATATATGTA ATGTAACTTA CTTTTAATTT 66 0 
CTGCAGTTCT TTAATAATAT CGTTATGATC AAGTTTTATA AAAGCATAGT ATTTACGTAA 720 
TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA TAACTATCTT TTTTTCTTCC 780 
CTCCTCTTTA ATTACATTAG GATATGCCAT TAAAAATTGA TGAATTTGTT TAGTCGAAAA 84 0 

GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT AAATCAACTC CTTTTTGTAA 900 
TCATACAAAA TCATATTCTA TTTTTGTTTT ACATTTCTAA TACAAAAACA TTGTCGATGT 960 

AATGTTATTT TAAGGAGTAA AAATACTGAC TTAAAAAGTG AAAAGTATGT TGGAAAGAAT 1020 

30 TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT ATAAATAAGT TTTGCAAATT 1080 

TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT TAATTCTTCA ACATGTGCAT 114 0 

TAAAACCCTT TTTAAATTG 1159 
(2) INFORMATION FOR SEQ ID NO: 420: 



20 



25 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 iC) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



45 



SO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 60 

CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 160 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 24 0 



55 



SCXX^O: <EP 07e6519A2.l_> 



1319 



10 



2S 



EP0 786 519 A2 

ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 540 

ATTATTAAAA CCAGCACATG PJ^GPAPJkTC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 660 

TGCTATCX3AA GGTGTTTCGC IKATTPiATATti AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 78 0 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 8I3 
(2) INFORMATION FOR SEQ ID NO: 418: 

,5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTATU^TTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 6 0 

TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 120 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 180 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 240 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 3 DO 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 

ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 420 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 4 80 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 54 0 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 600 

GCTAQATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 64 0 
(2) INFORMATION FOR SEQ ID NO: 419: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

55 
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5 



10 



16 



20 



25 



ATTCATTGTA 


TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT 


TTTTTATTTT 


TATCAGCAAC 


1260 


AAGTACTTCT 


GACTCTTCAC 


CAGATTTTTG 


TTGCGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


AGAATTTTCA 


AATGAAATTC 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTTT 


1380 


CACAGCATCT 


TCTGGGCTTG 


TTTTAACATC 


TTTTAAAGCA 


ATAACATCTT 


TTGTTGTATT 


1440 


AGTGTCTTGA 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


TCTTTTGGAG 


TATCATTGCC 


1500 


ACATGCAGTA 


AGCACCACCG 


CTGACATTGA 


TAACACTGCT 


AATGATTTTA 


ATTTCATAAT 


1560 


ATCACTCTCT 


CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 


TTATATGCTA 


TATAGATTGT 


1620 


ATTACCCCTT 


GTTTTTAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TGOGTTACTT 


TCGTTCGTAC 


TCCTTTTATA AATGAACCAT 


GTAATATAAG 


CATGCTATTA 


1740 


TCGACTCAGT 


TTGTCTAATG 


CTTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA 


TGCGGTTTAA 


CATCAAATGT 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA 


AAAACACTTT 


GACTATAACT 


1920 


TCXK3AGTTGA 


TCATCCATTC 


TAGTAGACAC 


CTCTGTCGTT 


TTAACATAAG 


AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


ACCATGGGAG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGCCAA 


TGATAGTAAG 


TACAAGATAt 


GTkATTGTTT 


TTAAAATCGT 


2100 


TTCAAAAGA 












2109 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 60 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 120 

ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 180 

GTTAGGAGTT ACTTTAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 300 

AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 360 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 420 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 
ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CX5TATGATTA 
GTCGTGTTGA GGCATTACAA AATATAkATT TACGTGTTTA TTATCCACCA GTCGTCATCG 
GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 
(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3000 
3060 
3120 
3176 



60 
120 
180 
240 
300 
360 
420 
480 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 
20 TCTTTATTTC AATTTCCAAT TGAATTTTTT TATTATTTAC GCATAGCTCT TAAAATTAAC 

GTTACGATTG CAATTAAGAT AATTGAACCA ATTAATGCTG GCAAGATGTA AATACtTCCT 
AATTCAGGAC CCCATTGTCC GAATAGTTTG CCACCTACCC ATGATCCAAT AATACCTGCG 
ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 
CCAATTAAGC CACCGACAAT TAACATTCCA ATAAATCCAA ACATAATTTT CAGTCTCCTT 
TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 
CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 
TAGCTATCGT TATTTAAAGT CTCTGTACCG ATAATATCAT ATACATTTAC ATTATTTTTT 
CTGCCGAATT CATAGCTTGA TTATTTTATG TTATAGGACT AgAATATACA CATATTATTA 54 0 

GAGCATCTTT GAATTTTAAA TCAAGAAGCG AGGTTAATGA ACAATGAATA TGCATATTTT 600 
ATATaACTTA CGAACTAAAC ATAATTTAGA AATTGACGAA TTAGCACAGC AATTAAATGA 660 
GAAATATGGT ACTAAATATG AAGCACATCA AATTTGGGAA TGGGAGAATC ATCACCAT^A 720 
ACCTAAATTT AAAGATGCCA TGCATTTAGC TGACTtCTTT GATGCACCAT ATGAAATGTT 780 
TTTAGAAAGT AAGGTTAAAG AATATCAGAA ACATTTAGAA GAAGTCGATA TTCGCATGGA 84 0 

45 TAAATAGATG CAAATAAACC CTCACAACAC GTTTGGCATA TATCCTTTCA AATCTATACT 900 

GGATATATTA CATTACGTTG TGAGGGTATT TTATTAATTA ATATGAATTA AGACATTTTA 96 0 

CAAGCGTTAA TGCAACGAAT CTTTTTAGTG ATCTTGCTCA CTCTTTAATA CTTTACCGTT 1020 
CTTAGCATCA ACAGTAACTT CTTGTTTTTT ATTACCTTTT TTCAAATCGA TATTX3TAAAC 



1080 



AAGTTTGCCA TCATCTTTTT CAAGTGACCA TTCTTTAATA TCACCATCAA ATTCTTTTTG 114 0 
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TTCTTGGTCA 


GTACTGGTCT 


CGGCATAATC 


GTTATAACGC 


AAAATATTTT 


AATAGCAGAT 


1200 




rrrxTAGCTA 


AAATTATAAG 


ACATCAATTT 


CAAGGTTTAT 


GGATTGTATT 


ATTTATTTTA 


1260 


5 


TTAGGTGTTT 


TACTTTTAAG 


AGCAACTGTG 


CAATTTCTAA 


ATCAATGGTT 


AGGTGATACA 


1320 




TTAGCATTTA 


AAGTTAAGCA 


TATGCTTAGA 


CAGCGGGTTA 


TTTATAAAAA 


TAATGGTCAT 


1380 




CCAATCGGTG 


AACAAATGAC 


TATACTCACA 


GAAAACATTG 


ATGGTCTAGC 


ACCrriTTAT 


1440 


10 


AAGAGTTATT 


TGCCTCT^GT 


GTTCAAATCA 


ATGATGGTTC 


CGCTCATCAT 


AATCATTGCA 


1500 




ATGTTTTTCA 


TCCATTTCAA 


TACCGCATTA ATTATGTTAA 


TAACTGCACC 


ATTTATTCCT 


1560 


IS 


TTGTTTTATA 


TTATTTTCGG 


TTTGAAAACG 


CGAGATGAGT 


CAAAAGATCA 


AATGACTTAT 


1620 


TTGAATCAAT 


TTAGTCAACG 


GTTTTTAAAT 


ATTGCTAAAG 


GTTTAGTGAC 


GTTAAAGCTA 


1680 




TTTAATCGTA 


CAGAGCAAAC 


AGAGAAGCaT 


ATTTaCGACG 


ATAGTACTCa 


GTTTAGAACT 


1740 


20 


TTAACAATGC 


GCATTTTaCG 


CAGTGCTTTT 


TTATCGGGAT 


TAATGCTCGA 


ATTTATAAGT 


1800 




ATGTTAGGTA 


TTGGATTGGT 


TGCATTGGAA 


GCAACGCTAA 


GCTTAGTAGT 


ATTTCATAAT 


1860 




ATTGATTTTA 


AAACTGCGGC 


AATTGCGATT 


ATTTTAGCGC 


CTGAATTTTA 


TAATGCAATT 


1920 


2S 


AAGGACTTAG 


GGCAAGCGTT 


CCATACTGGA 


AAACAAAGTG 


AAGGTGCCAG 


TGACGTTGTG 


1980 




TTTGAGTTTT 


TAGAACAACC 


GAACTATAAT 


AATGAATTTC 


TATTAAAGTA 


TGAGGAAAAC 


2040 




CAAAAGCCAT 


TTATTCAGTT 


AACAGACATA 


TCATTTCGAT 


ATGATGATTC 


TGATAGATTG 


2100 


30 


GTATTAAATG 


ATTTAAATTT 


GGAAATATTT 


AAAGGTGATC 


AAATTGCACT 


TGTAGGTCCA 


2160 




AGCGGGGCAG 


GTAAATCCAC 


TTTGACACAT 


CTTATTGCAG 


GTGTTTATCA 


GCCAACAATA 


2220 




GGTACTATAA 


GTACAAACCA 


GCGTGATTTA 


AATATAGGAA 


TACTTAGTCA 


ACAGCCATAT 


2280 


35 


ATTTTCAGTG 


CTTCTATAAA 


AGAGAATATT 


ACGATGTTTA 


AAGATATAGA 


AAATAATACT 


2340 




ATTGAAGAAG 


TGCTAGACX3A 


AGTAGGTTTA 


TTAGACAAAG 


TGCAATCTTT 


CACAAAAGGC 


2400 


40 


ATTAACACAA 


TAATAGGTGA 


AGGAGGCGAA 


ATGTTATCTG 


GTGGACAGAT 


GAGACGCATA 


2460 


GAACTTTGCC 


GTCTTTTAGT 


TATGAAGCCA 


GATCTCGTTA 


TATTTGATGA 


GCCTGCAACT 


2520 




GGTTTAGATA 


TTCAAACAGA 


ACACATGATT 


CAGAACGTTC 


TGTTTCAACA 


TTTTT^AAGAT 


2580 


45 


ACAACGATGA 


TTGTCATTGC 


ACATAGAGAT 


AATACAATTC 


GCCATTTACA 


ACGACX3CTTG 


2640 




TATATAGAAA 


ATGGAAGACT 


GATTGCTGAT 


GATCGCAATA 


TTTCAGTAAA 


TATAACAGAA 


2700 




AATGGTGATG 


ACTTATGAAA 


ACACGACTAA 


AATTTCAAGT 


AGATAAGGAT 


TTATTGTTAG 


2760 


SO 


CTATAGTTGT 


TGGTGTTTGT 


GGAAGTTTAG 


TTGCGCTCGC 


CATGTTTTTC 


TTAAGTGGTT 


2820 




ATATGGTGAC 


ACAAAGTGCA 


CTTGGTGCGC 


CACTATACGC 


TCTGATGATT 


TTAGTCGTTA 


2880 
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960 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAAITG GAATTGrrCG 

ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 

TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 

TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 
,5 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

CTTTACCAAT GCCAAATCCG AAGTAAAGTA TAGCAATAAA GATTACTAAT ACAATTCTGT 6 0 

AAATGGCAAA TGGAATTAGT TTGATTTTGT TAATTAGATG CAAGAATGTT TTGATTGCAA 12O 

TTAGTCCAAC AGTAAATGCA GCTAAAAAGC CTAAAATATA AAAAGGTATA TCAGCAATCT 180 

GAATATCTTG ATAATGTTTT AATAAAGATA AACCACTAGC TGCTAACATA ATTGGAACAG 240 

CCATAATAAA TGTAAAGTCC GATGCTGCTT TATGATTTAA TTTCATTAAT ACCCCAGTTG 300 

AAATTGTTGA GCCTGAACGG CTGAAACCAG GCCACATAGC TACTGCTTGA gAAATACCAA 360 

TTACAAATGC TTGGAAATAA CTGATTTGAT CTACTGTTTG TGGGTTTTTA ACTTTAGCTG 420 

AGTATTTATC AGCAATAATC ATATAGATAG CACCTACGAA TAAGCCAATC ATAACAGTTC 480 

GCACACTAAA TAAATGTTCT TCGATGAAAT CATCAAATAG TAAGCCTAAA ATACCTGCTG 540 

GCACCATACC CACTAATACA TGTAATAAAT TTAAACGTCT TGGCTTTXSAA CGTCTTTGTT 600 

GATCGTTATC TCCTTCAACA TGTTTGTGTT TACCAATATG TAAAATCTCT AAGAAGCGTT 660 

CGCGGAACAC CCATGCTGCT GCAAAGACGG ATCCTAATTG GATGACGATT TTAAATGTAA 720 

ATGCTGACTG AGAACCTAAA AATTCAGATG ATTTTAACCA CATATCATCA ACTAGGATCA 780 

4S TATGTCCAGT AGAGGAAACA GGTGCAAATT CTGTTAATCC TTCGACGACC CCTAAGATAA 84 0 

TACCTTTTaT TAATTCAATG ATAAACATAA TGTACCCACT TTCATTACTC AATTTAATTT 900 

ATTTAAATAT CAAAATTACC ATATCATGAT AGCATATTCA TTTAAAGACA TGCTAGTTAT 96 0 

AGTTATAATA CTAGACTAAA GATGTATATA TTCATTTTCT TTTACATGTA AAACTACAAT 1020 

ATTTTATTGA GCTATTTAAT TTGATTTTAA GGAAAACCTT TTATAATAGG TTTAGGTCAT 1080 
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TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 78 0 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAATCCATTG CAATACCTGT 84 0 

TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 900 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 960 

ATACGTATTA GAAGTATAGC GTTTTAAAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACX5ATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAAC/UIT 114 0 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 1200 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 1260 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 
ATACCCATTA TACCAATCAT TT 
(2) INFORMATION FOR SEQ ID NO: 414: 



1342 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

CTGTATAAAG ATGGAGGTGT TTTGCATGGT AAAACGTACT TATCAACCAA ATAAACGTAA 6 0 

ACATAGTAAA GTTCATGGTT TCAGAAAACG CATGAGCACA AAAAATGGCC GTAAAGTTTT 12 0 

AGCGCGCCGT CGTCGTAAAG GCCGTAAAGT TTTATCTGCA TAAGATCACT GACTCATCAG 180 

TGAtCTkTTT TTTCGTTTAA ATTAAGAATA AATAGAAATT TATGTTATAA GCTCAATAGA 24 0 

AGTrTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT TGGAAAAAGC 300 

TTACCGAATT AAAAAGAATG CAGATTTTCA GAGAATATAT AAAAAAGGTC ATTCTGTAGC 360 

CAACAGACAA TTTGTTGTAT ACACTTGTAA TAATAAAGAA ATAGACCATT TTCGCTTAGG 420 

TATTAGTGTT TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA AnAGAGCAAT 4 80 

ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA TTGTAATAGC 54 0 

AAGACAGCCA GCTAAAGATA TGACGACTTT ACAAATACAG AATAGTCTTG AGCACGTACT 600 

SO TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG GAAGGAAAAC 660 

ATTAACCACT CAACACATCC CGAAGTCTTA CCTCAGACAA ACGTAAGACT GACCTTAGGG 720 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



5 


TAACGTCGAA CTTGAGCTGT 


TACGTTATGA CTCATAATTA 


TTTTAGCATA 


GTCATTTAAA 


60 




TAAACTTCTG TTCTGTCTGT 


TGGATAAGCA AATTCAAGCA 


ATTGACTGTA 


ACTATCATTA 


120 




ATGACTTCTT GATCAACATG 


ACTATCAAAA TATACAGCAT 


AATAATAAGT 


ACCATCAACC 


180 


10 


ATATATAACA AATCTTCAAA 


CTCTGTAGTT ATTGGATTGC 


TATGATATGC 


ATAATTAATA 


240 




ACATCTTCTA AATCGTTAAA 


TTTCACAATG ATTGTTCTTG 


TATTTTTACG 


TGCTGAAGAC 


300 




TTTTGACGTT TAGAACCTTG 


AGCTTCTTTT TCTTTTGTTC 


GTTGCTCGAA 


TAATTCTTCT 


360 


15 


AATTGATCTT CACCTTCTAA 


TGTTTGAGCT AACAATTCTT 


GAACTTGTTC 


ATCAAATkGA 


420 




TCAGTTGCAT CATCATCAGA 


CATATTCATC ATATCTTCAT 


TTTTAGATTT 


AGAAATTGTG 


480 


20 


ACTTCGACAC CTTTTTCAAA 


GGCATGTACT TGAATCCATA 


ATGGACCTnC 


TACAACAAAA 


540 


TCTTCTACTT CGTTAATTTC 


ATCCATCATT GAnCAAAAG 






579 




(2) INFORMATION FOR SEQ ID NO: 413: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 





GTGGTGAAAC 


TTCTTGCTTT 


GTAATTTTAT 


AAAGTGGATC 


AATATCACTC 


TGACTTACAT 


60 


35 


CAGGTTGAAC 


TGTCATCTTT 


TTAGTTACTT 


TGTTTTCAAG 


CGTAATATTT 


GCTAATTcTA 


120 




GGtTTTACAG 


AATAATGAAT 


GTCATCAGCT AmCCCTTTmm 


CtTGATACTy CTCTTGATAT 


180 




TTTCCAGCTT 


TAGCATTTGA 


TAAATCAATC 


ACTACTCTTA 


AATCTTCTGG 


ATTTTCAATT 


240 


40 


TTTATTATCT 


TTGATTGTGG 


TCCTGAAATA 


GTCACATTAA 


CTGTTTCAGG 


CGCTTTTGTT 


300 




AAATGCAAAT 


CTTTAGTGTT 


ATAAAGAATT 


TCAACGGGTA 


CATCTTGAAT 


CGTTTTACTA 


360 


45 


GACTTTTGAC 


CAAGATTACC 


AGTGTTAAAG 


ATATTTCCAA 


AAACATTGTT 


AACAGATAAA 


420 


AAGAAAAACA 


ATGCCAAAAG 


AAAGGCAATA 


AATCTCAAGC 


CCCATTTACT 


TTCTAGCATA 


480 




TTATTTCACA 


CCTTTCTTTT 


GAAAGCGTGT 


GCCAAACCAA 


TGTTCAGCAA 


GCAACTCTTC 


540 


SO 


AAAAATTTCG 


TTTGAAATGT 


CTCGTCGTAA 


TTTTCCATCA 


AATGTTACCG 


AAATATCACC 


600 




AGTTTCTTCA 


GATACAATAA 


CGGTAAATGC 


ATCAGATACT 


TCTGAAATAC 


CAACCGCAGC 


660 
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AACTTTTCTC CAACTATGTC AGACTATA 2488 
(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA €0 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTCCAGTT GTATTTTCAG ATGAAAGTCC 120 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

AGGACAAATG CCACCTTCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 240 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 300 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 360 

2^ AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 420 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 480 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 54 0 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAGCCATTA 600 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGCCTA GATCTTTAGC ACCTTGATAA 660 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT 720 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG ATAAGCATAG TTCGATTACT 780 

GCTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

GCAAGTGTTC TCTGACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATCGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 960 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGCGCTCAT AACATCGTAA 1020 

45 TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 1080 

TGCAATGTAA CATGTCGATA ACCGT 1105 
<2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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TTAAATATCT TGAAGATGTG TCAGATGAAG ATATCATTAA TTATCAAATT AAAACAGGTG 
CACCGCTTGT TTATGAATTA ACGGATGATT TAGAAGTTAT AGATAAATAC TACTTATAAA 
* AaAAGAGCTG CATGTACACA AGGAGTGAGT GTATATGcAG CTCTTAAAtA TCTGAAGTAA 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT ATTAACGAGC TAGGGATACT CX;AAAATATA 
GTTAGACATA CAATATAGTC AAATTAAAAC AATTATTTCG CTCITTTATG TTGCTTAATA 

10 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG TTAGGGCTAT TTAAATTACG ACGAGCAGTC 
TGTAAATCTA ATTTCATCTC TATCCCTCCT TGTAAATATA TTATCACCGA TAACTACTCA 
TATGTAAATA GTAATGATTA CGTTTTAAAG AAATTGTAAT AAAGTCGTGC TAATTTTTTG 
GAAAATGGGT ATAATTACCG GATATCTAAA AATGTGTGTC GTTTTTTAGA TCGTGAGGGG 
GAAGCTTTAA ATGTCX3AAGA AACAAAAATT AACGATGATT ATTACTATGC TGATGGGTGG 
20 ATTTTTTGGA TTATTAAATG AAACACTATT AGTGACGGCT TTACCAAGTA TTATGAAAGA 

TTTTGAAATT TCATATACAC AAGTTCAATG GCTGACAACA GCTTTTTTAT TGACTAATGG 
GATTGTTATT CCTTTGTCCX3 CGCTTGTTAT ACAACGTTAT ACAACAAGAC AAGTGTTITT 
25 AGTGGGTATT TCTATCTTTT TCTTAGGTAC ATTACTCGGC GGCTTCAGTC CGCACTTTCC 

AACATTATTA GTTGCTAGAA TTATTCAGGC GTTAGGCGCA GGTATTATCA TGCCATTGAT 
GATGACAACG ATTTTGGATG TTTTCCAACC ACATGAACGC GGTAAATATA TCGGGATATT 

30 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT TGGACCTACT CTTTCAGGTT ACCTTGTTGA 
ATATTTTAAC TGGAGATCGC TTTTCCATGT TGTCGCTCCA ATTGCAGCTG TGACATTTTT 
AATTGGaTTT AAAAcGATAA AAAATGTTGG AACTACAATT AAAgTACCTA TTGATTTTAT 
TTCTGTCATT TTTTCTGTAC TAGGTTTCGG CGGGTTATTG tATGGAACGA GTTCaATTrc 
AGAAAAAGGT TTTGATAATC CTAcGtATTA GTATCTATGA TTGGAGGCGT TGTITTAGTC 
^ GCATTATTTG TAwTACGTCA ATATCGGCTA TCAACACCAT TATTaAATTT TGCTGTATTT 

AAAAATAAAC AATTTACAGT TGGTATCATT ATTATGGGTG TCaCAATGGT ATCGATGATT 
GGTTCGGAAA CGATTTTACC TATCTTTGTG CAAAATTTAT TGCATCGTTC AGCTTTAGAT 
TCTGGATTAA CTTTATTACC AGGAGCAATT GTTATGGCAT TTATGTCGAT GACTTCGGGT 
GCTTTATATG AAAAGTTTGG TCCTAGAAAT CTTGCTTTAG TAGGTATGGC GATTGTTGTT 
ATTACTACGG CTTATTTTGT TGTAATGGAT GAACAAACAT CAACAATCAT GTTGGCAACA 

SO 

GTTTATGCGA TTCGAATGGT TGGTATCGCG TTAGGATTAA TTCCAGTAAT GACCCATACG 
ATGAATCAAT TAAAGCCAGA AATGAATGCA CATGGTTCAT CTATCACAAA CACAGTACAA 
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CPiPAPJATPJK GTGGTTTAAA CAAAACTGAT CAAGCAAACT TAAATAAAAT TGAAAATGTG 600 

TATAAAATCT TAAATAGTGA TTATTACAAA AAACAGGACT CTGACAAGTT AAGTAAAGCT 660 

GCAATTGATG GCATGGTCAA AGAATTAAAA GATCCTTATT CTGAATATTT AACAAAAGAA 720 

CAAACGAAAT CCTTTAATGA AGGTGTTTCA GGTGATTTTG TAGGTATTGG TGCAGAAATG 700 

CAAAAGAAAA ATGATCAAAT TATGGTTACT AGTCCTATGA AGGGATCTCC AGCAGAACGT 84 0 

GCTGGCATTC GTCCTAAAGA TGTCATTACT AAAGTAAATG GAAAATCAAT TAAAGGTAAA 900 

GCATTAGATG AAGTTGTCAA AGATGTTCGT GGTAAAGAAA ACACTGAAGT CACTTTAACT 960 

GTTCAACGAG GTAGTGAAGA AAAAGACGTT AAGATTAAAC GTGAAAAAAT TCATGTTAAA 1020 

AGTCTTGAGT ATAAGAAAAA AGGTAAAGTT GGAGTTATTA CTATTAATAA ATTCCaGAAT 1080 

GATACATCAG GTGAATTGAA AGATGCAGTT CTAAAAGCTC ACAAAGATGG TTTGAAAAAG 114 0 

ATTGTTTTAG ATTTAAGAAA TAATCCAGGT GGACTACTAG ATGAAGCTGT TAAAATGGCA 1200 

AATATTTTTA TCGATAAAGG AAAAACTGTT GTTAAACTAG AnAAAGGTAA AGATACTGAA 1260 

G ^261 
(2) INFORMATION FOR SEQ ID NO: 410: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3S 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 
AAATATATTG AAnAGAnAAT TACTAAGATT AAATCnTCTT AAAATATCCC 
TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA 
GACAAAGCGA GTCGAATGCT AAAAACTTAT TTACTGGATG GGAAGATGTT 
AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT 
TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT 
CTCAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT 
ATGGTGGATT GCAAGGCTTA AATAAAGATG ATGCTAGAAA AGAATTTGGA 
TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAACCGAA 
AAGCTTACTT AGCTGATCGT CGATATAATC ATTTAGATAA ACGTATGATG 
AAAGTCTGAA AGATACTTTA GTTCGAGTGA TACCATTTTG GACAGATCAT 



TGAAATAACG 

TGTCGTCATG 

AATTTATCTG 

AACATTGCCA 

TATATTTTAA 

GAACGCCACT 

GAAGAACAAG 

GAACAACGTG 

CCTTATTCTG 

ATTTCACAAT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
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AGTGCAAGTC TAGTTAATAA AAATCCAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 
GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 
CAAGCATTAA TCCCGCCITT AATTGCTTTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 
CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTX3AGA 
GATTTAGCTG GTTCTAGAAT TGATTTAGAA ACATTACAAC ATCGTGTATC AGATATTAAA 
ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 
GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 

,^ GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CX5CATGCCAT TAACCCAACA 

CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAACAATTAC TGTCAAAGCG 
TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 

20 CCGATAAAAA AAGAGGTGTA GTACTACATG GCACACRTTa AraTaaaTa^ *.n.r^r.r,^r.-.^ 

' iVi. U«-U-VVJn A J. 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCAAAGTA AAAATATGCA ATTCACACCA 
GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 
AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 
A 

(2) INFORMATION FOR SEQ ID NO: 409: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT 6 0 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 12 0 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 18 0 

ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 24 0 

TGTGACTAAA AATTAATCAA TTATAGTGAA* ATATGGTGCG CTATCTTGCA TAAATTGATA 3 00 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 36 0 

GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 42 0 
GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 



480 
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CAGTTTTATT ATGCTTACCA TTAC A TTTTA AAGTAACTAA AATACCTGAA GaTGCATCAA 294 0 

GaTACATGCC GTTAAAAGTT ATTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 324 0 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 33 00 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 3420 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 80 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3 540 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3 600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3 660 

AATTOAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

AGAACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3 7 80 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3 84 0 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4 020 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4080 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 
AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 
GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 4 3 80 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 444 0 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4 500 

SO 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4 560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 
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AAAAATGAAT ACCAGCAACT TCATCAATAA TTGTTGATCC ATAGCACTTA AATATGACAT ' i,, 

CTTTCTCCAA AATATCATTT AAAATACGTG TCATTAGTX3G ACCATTGTGC GTCGTTTCTT 12C 
CTGATAATGT ACGAATCTCA CCTGTAATAT GAACGTTTGT CGACAATTTG ATCTGTGGCG 

ACATAGCTGG GTATTTAGGA ACTAATGTTC IXSAAAGATAA ACCAGCATAA TAATCCaACXS 132 

TATGTTTTGC TTCAATGATT AATTCTTTAT CTACTTCTGC -m^ATAATCA GAATCTAATA X38 
CX5TCATCTAA TTGCCATGGA TGAACAATCA TAATGTGATA ATCATTAAAG TTAAACTTTG 

GCGTAAATTC ATTTTCTAAT TGTTTAATTA AGTCCGGAAA TAGITGATGA ACAGTTGTAT 150 
„ CATAATCTTT AGACAGTGAC ATAGTACGGC TTAATITACT GTCAATCAGT ACTATTTTCA 

ACTTAATAGG TTGATTGAAT TCTGAAGAAT ATAGGAATGT TTCTAATCCA TTTAAACCTT 162. 

TACGTAATTT AGCCCCAGGA TGTAGCGGAT GACCTTCAAT AACGGCTTGC TCTCAACGCA X68( 
20 AGTAACTATC ITCGCTATTT TCGATAATAT TAAATAAAGG TCCAGAATCA TGTTCCATTG 

ACAGTGCITG ATAGCTAATT GCAAATGTCA TATtAGTtGC ACrciTTATT AAATCTTGCT 180C 

GAAATTGATC ACTAGCAGCA TTTrTTAAAT CTGGTGCTTC AATTAAAATA CACTCAAGAA 185C 
TTTCATTTGG ATGGTGTACT CGTGTAATCG TATTTCTAAT GTCATCTTTA ATCTAGAAAG 

GGCCTTCAAC ATCAATTCGA TCAAAGGCGT GTTCTCCAGT GATAGGAGCA TATAATGTTT 1980 

GCtTAGCTTG TGGGAACTGG ATTTCTAGTA TATGAGTCGT TCAGATATCT AACATAATCA 2040 

AATCACGACT CAATATTTTC TTACTTTGAG TGCGCGCTTT AACTAAGTTT TCGCGATGCA 2100 

TTGATGTGAC CAATCTCTGA GTGACTTTAT CTCTTCCTTG TAAAATCATC TCTTTAAAAA 2160 

3, TATTAGCCCA ATCGCTATTA TGTTGTTGTA AAAATAAATA TClTrCTTGT TCTTCTTTCT 2220 

TAAATTTTAA TGTCTGTTCT TTAAAAATTA AGTTCAAGTT CATAATTCAC CTCTATCAAA 2280 

TATTTTACAA AAGCAAGATA GATTTGTATA ATCCATATTA ATGATAATCA yTCTTATTAT 2340 

- CAACAGAATO CGGGTGTAAG TTTTATCACA AAATATmr TTAGCAGTTC TTTTCTACTA 2400 

TTTCTAGGTA ATTGGATTGG ACAAATAGGG CTAAATTCGT rTCTACTTAC CACTTATCAT 2460 

AACGCAGTTT ATCTCGGGAT TGTCAATTTT TGCAGACTTC TACCAATATT ATTACTAAGT 2520 

CTGTGGGCAG GGGCAATTGC CGATAAATAT GATAAAGGGC GATIXSCTGAG AATTACAATT 2580 

TCATCATCAT TTTTAGTAAC TGCAATTTTA TGTGTGCTCA CGTATAGTTc ACTGCAATTC 2640 

^ CAATTAGCGT CATTATTATA TATGCGACAT TAAGAGGGAT TTTAAGTGCG GTTGAAACAC 2700 



CTTTAAGACA AGCAATCTTA CCAGATTTAT CAGATAAAAT ATCTACTACA CAAGCTGltnw 
CATTTCATTC ATTCATCATT AATATTTGTC GTTCAATAGG GCCTGCCATT GCTGGTCTCA 



2760 
2820 



55 



iNSOOaO: <EP 0786S19A2_I. 



1306 



10 



15 



20 



3S 



40 



SO 



EP 0 786 519 A2 

CGAAwwTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 1440 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 16 80 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

^5 GAAAGCGACC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA ISO 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 24 0 

^ TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 300 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 480 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 54 0 

GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 
GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 
ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 
GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 960 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 



720 
780 
840 
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GCACTATCCA TACTAATTTC AGATA 
(2) INFORMATION FOR SEQ ID NO: 4 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 



1705 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
,5 TCAATATATC TTAAAATTCA ATGATTAAAT CnATTATCAC TAGACATnAA ATACATAAAT 

CCTATTCCCC ATTTTCATTT nTTAATTCAT AAATGAATCA ATAACCACCT AATAACAAAT 
CATATTATAC ACCTTTGTTC TCTATTTTTC TAAGGITTAA AAAATATTTT TAGGTAAACC 

TAAAAATAGA TGTAATAAAA TKCnrr^nn-rn r^rxT,^r.rrm->^m »r«-T— - 

ATACATTATA GATGGTCTGG TTCTGGGTGA ACGTATACTG AGGAAAtACC TTTTTTGTGC 
AAATGATGTT CGACATTGTC ACAAATTTGA TGCGCTTCTA CTAAGGAAAG GTTAGCATCT 

25 

ACAACAATTG TGACATCAAT AAACACACTA CTTCCATGGT AACGCCCTTT AATACTTTTA 
ACTTCTTGTA CTTCATCAAC TTCTAAAATA TCATTGCGAT ACGCTTCTAA TTCAGTTTCA 
TTGAAACCAT CACTCAACAT AAAAATTGCT TCTTTAAAAA TACCAAAACC AGTATAAACG 
ATTAGTAAGC CTAGTAATGT TGCTAAAATA ATATCGACT^ TTGGGAAACC GATTTGCGTA 
AAAATTAATC CTATCGCTGT TCCAATGCTG ACTAAACTAT CCGATAAATT ATCTTTGGCA 
3S GCCGAATTTA AAGAACTACT TTTCGTTCTT TTCGCTAGTC TTTGATTGAC TGCAAATACA 

ATCAACATTA CAAGACCACT GATTAAGCTG ACGATAATTG TTATTGCGTT AGGTACAACG 
TCATCTTCTT TGAACAAACG AGGTGCATTT TGAATAACTA CTTGGATACC TACAAACATA 
ATGACAAATG ACACCAATAA TGAAGAAATA TTTTCAGACT TCAAATGGCC ATAAGGATGA 
TTTCGATCGG CAGGTTTAAT TGAAATTTTC AATCCAATAA TAACAGCTAA AGAAACGATA 
ATATCTGTCA TATTGTTTAA TGCATCGGCT CTTACAGCTG CAGAGTTAAA GACAAAACCC 

45 

GTGACATACT TAACAATAGA TAAGATTATA TATACAATTA AACTCAAATA AGCACCGCGT 
TGCGCCAATT TAAGATTTTC ATTATGAGAC ATGCGTTGAA CCACCTTGAA TTAGTATAGT 
AACAATATTA TGAATGATTC ATriTAATTT TACAACGTTT TTAATTTTTA TAAATTTTTA 
TAAAATTAAA CTAATTTATT CATTCGCAAC CCCTAAAAAT AATmTAGC CTTTCTGCGA 
ATTTTATGAG CTAGAAAGGC GCCCAACTCT CCCTGTTTGT TAACTTTCGC CTCX3AAAGTT 



55 



1304 

BNStXXMO: <EP_07e6519A2_l_> 



EP0 786 519 A2 



IS 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC 60 

AATATATTTA ATGCTTCTGC TACTTTTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

TACTCTTGAA AAATAAAACC AATATCATGC TTGCGTATAT CAGATwATTC CTTGTTTGAA 24 0 

AGCTTTTCTA ATTTTTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 300 

CTTAAAACAT TTAATAATGT CGTnTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 3 60 

TCGCCTTCTT CAATAGACAT ATTGATATCT CGCAACACTT CTTGTGCCAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 4 80 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 540 

CATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC 600 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATGCGTTC AACAATTTCA TTTTGTAATG 660 

GGAAAATCAA TCTGACAGTT GTCCCCTTAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 720 

CTAATTGATC CTTTACACTA TTTACTAAAT ATAGACCCAT ACCTGAAGAC GTCGTTTCAT 780 . 

TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATTCG CGGCATATCT TTTTTACTAA 84 0 

30 TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG ATCATTTAAT TCTGTCCCAA 900 

TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 960 

TACGACACCA TTTTATATCT GTATAAACAT AATCATCCAC TTTAAAGTCA ACATCAAAAC 1020 

35 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTGTAT TTCATCAATG ACCATGCGTT 1080 

TAAGTGACAC GTAATCAAAA TACATATCTT TACGTTGAGA TTCTAATCTA GTAATATACA 114 0 

GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

40 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

45 TTTCTTTGTG CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 13 80 

TTTCCGCTAA ATCTTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 1440 

ATAATTTTAC TTCTTTAAAA TATGTCAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 1500 

AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 1560 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 
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CAGTTACAC5G ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATrTATTAT 
CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 
CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 
CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGrrCT CCTAAATGAA 
TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG rTTTCGATGA 
ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATrC 
GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 
,5 CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATrT 
CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 
ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 
AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 
TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 
TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 
GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 
TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 
TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 
AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 
35 TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 
CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 
TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA ITGATTTCCG TTAAAGTTCA 
TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 
CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTCAGACAA CTCTTCAATT CAAGCATCTT 
CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 
AAAAACACTT GTGTATTTAT TACTTAACAT TCACTCATCT TAACTCTTGA 
so (2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
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TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGGATATC TCX5AATTGCG 420 

GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

TTCGCTTTAT CAAAACAACC TTCTTCAGTT AATTTTTTCT TTAATGCTTC TAATTTTTGA 54 0 

TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACGA 600 

CGTTCAAAAA CAGAAACACG TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 66 0 

TTTAAITTAG AAGCACTACC TTTGAACATC ATGGCACTTA TAACGCTTTC TTTATCTTTC 720 

ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 7 80 

AATACAGATT GGAGATGTGG ATCTTGATCA AATTTATATT TAATATATTT CGTTAAAGCT 84 0 

GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT TATTTTTTTA TATTACTCAA 900 

TACACCATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

20 TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 114 0 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA 1200 

ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 12 60 

ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 13 2 0 

GTATCACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 13 80 

ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 1440 

GAAATATTAA CACCATGTTT TAATGCACAA TATACATCTA TATATATGCC ATCTTCTTTA 1500 

CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 1560 

TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAGTAGCTAT ACTTGCAATA 1620 

ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 1680 

ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA I860 

TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 192 0 

ATGGCGGTAT CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

GTGCAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 204 0 

CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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ACCATTTGGA TTTTCAGGAA AGC3ATTTAGA ATAAATCACA GAACATAAAT CAGGATTTGT 
TTCAATTATA C3GTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 
TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 
GACAAGAGAT GACGCGCAAA TTATATTCAT TGATACGCCA GGTATTCATA AACCTAAACA 
CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 
CATGTTTATG GITAATGCCA ATGAGGAmAT TGGAOJAGGC GATGAATATA TTATAGAAAT 
GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 
AGATGAATTA ATXSCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 
ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTCATGTTT TAAAGACGTA 
TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 
20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTCAAG AAATCCCTCA 
TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 
AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 
AAAGTTAAAA GAAGTAGGaA AACGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 
AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 
TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 
AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 
CACGATTTTA AATG 

(2) INFORMATION FOR SEQ ID NO: 405: 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 . (D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 240 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 300 
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GCAACGTATA TATGTTAATG CCATGAAAAA TAATGATTTA GTATTTGGTA TAGGTCCTGC 34 80 

TGGTACAGGT AAGACATTCT TAGCTGTAGT TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3540 

TGTTAAACGT ATTGTATTAA CAAGACCTGC TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3600 

ACCAGGAGAT TTGAAAGAAA AGGTAGATCC ATATTTAAGA CCTTTATATG ATGGTCTATA 3660 

TACTGTTCTT GGGCGTGAAC AAACAGAGCG ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

GCCACTTGCA TATATGCGCG GACGAACATT AGAAGATGCA TTTGTAATTC TTGATGAGGC 3780 

GCAGAATACG ACACATGCGC AAATGAAAAT GTTTTTAACA AGACTAGGTT TTGGCTCAAA 3 84 0 

AATGGTAGTT ACTGGTGACC AAACTCAAAT CX5ATTTACCT AAAGGTGTTA AAAGTGGACT 3900 

TAAGGAAGCG GTCAGTAGGT TACACAACGT TAAAGGTATA AGTATATTGA AATTAGATCA 3 960 

GAGCGATGTA GTCAgACATC CATTGGTAAG TAAGATCATT GAACATTATG AAGGAGAGAA 4 020 

20 TTAAATGTTT ACGATAGATT TTAGCGATCA CACAGGCTTA GTTAAAGATG CTTGGTATAA 4080 

ACAAATTGAA GATTTATTAG AATTTGCTAA AAAAGAAGAG CATATAGAAG ACGATGCTGA 414 0 

GCTTTCTGTT ACATTTGTAG ATAAACAAGA AATACAAGAA ATTAATCGAA CATATAGAGA 4200 

TAArGwTAAr GTTmCaGATG CAaTCyCaTT CGCTTTAGrA GrAGATGAGC CmGaGATlcGA 4260 

TtTTAGTGGT CTTGATATAC CACGTGTTTT AGGGGATATA ATTATCTGtA CGGATGTAGC 4320 

GCAAGrACAA GCAAACAATT ACGGACATTC TTTTGAACGA GAATTAGGAT TTTTAGCATT 4 3 80 

ACATGGATTT TTGCATCTAT TAGGTTATGA TCATATGACT GAAGCGGATG AAAAGGAAAT 444 0 

GTTTGGTCGA CAAGATACAA TATTAAACGC ATATGGATTA ACACGAGACT AATTATGAAA 4 500 

AGGTTTAAAT ATGCACTTGA TGGGCTGAAA ATCTTAATTC AAAAAGACTA TAAATTTCTT 4 560 

TTACATGTGT TTGCAATGAT TGTTGCTATT GTCTTTGGTC TCGTACTAAA TATTAATCGG 4 620 

ATTGAGTGGA TATTTATACT CATTGCTATT GCATTAGTTC TCACTGTTGA AGCTTTAAAC 4 680 

ACTGCTATTG AATATGTTGT CX3ATTTAGTG ACCGTTGAAT ATCATGATTT AGCTAAATAC 4740 

GCTAAAGATA TTGCGGCTTT TAGTGTACTT ATAGTTTCAA TATTAGCATT TATTATAGGT 4 800 

TTAATAGTAT TTTTACCACA TTTTATAGCG TTATTTTAGG GAGGCATATA TGAGTTATCA 4 860 

45 ACCTCATTAT TTTCAAGAAG TTAGAAAAGC ACAACAAGAA TCATATTCGC CATACAGTCA 4 92 0 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA AGACgGTAGA ACTTTTTATG GTACCAATGT 4 980 

AGAAAATGCT TCTTATCCAT TATCGATATG TGCTGAACGA GCTAGTTTGG TATCGGCAAT 5040 

SO 

TTCTCAAGGA TACAGACCAG GTGATTTTGA ATCAAtAACT GTAACCGTAG ATGCAGATAA 5100 

ACCGTCATCA CCTTGTGGTG CATGTCGTCA AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTriTAAGC 


1 1^ Q A 


5 


AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 


X / 4 U 




ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CArTAGCAGA CAAAAATATT 


X8 00 




GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA 


1860 


10 


GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCArTAGCT 


J. ^ u 




ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTArTATAA TTTGAAAAAT 


X 7 o u 




ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 




IS 


GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 


2100 




CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 


^ X o u 




AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTCGCTTTTT 


9 9 *? n 


20 


TGAAGAAATT GAGCGAACX3T TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 






ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 


!? 4 n 

45 -3 •* U 


2S 


AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 


^ 1 U 


ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 


^ *» O 




ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 




30 


TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 


^ 3 O U 




GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 


264 0 




GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 


2700 


3S 


CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 


2760 




AAAACAATTA TAAAAl l 1 "IT TGAAAACAAG CACTATCGTA ATCGTAGTTG CrrGTTTTTT 


2820 




TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 


2880 


40 


ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 


2940 




AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 


3000 


45 


AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 


3060 


GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 


3120 




TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 


3180 


SO 


CCATGCAAGA GGACAAGaAG TTGCCX3TTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 


3240 




GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 


3300 




AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 


3360 



SS 
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(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 60 

CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 180 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 24 0 

AGTTATATAG TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 300 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 360 

TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

CAATATAATT GTTCAACCTA TTTTTACGTT AATATTAACC TGTTTGACAT TCTTAGGATT 480 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 54 0 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 

ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 660 

TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 720 

TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 7 80 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 84 0 

TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 9 00 

35 AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 960 

TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 1020 

AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 1080 

AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 1140 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGGAGGA 13 20 

AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 13 80 

TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 1440 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 1500 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA 


ATTGTGATTC 




ATTTGAAGAT 


240 




GAAGCAAATG ATGTTCTTGT TAATGCTGTT 


TATGCAGAGA 




A 1 A rGG AAAT 


300 


5 


AGATATCGTA AGGACTATAG CAATGTTGAT 


AAGAGCTTAA 


ATGAAGCTGA 


ACGATTATTT 


360 




AAAAATAATC GCTATAAGCg TGCGATTGAA 


ATTGCAGAGC 


AAGCTCTTGA 


AAGTGTTGAG 


420 


10 


CCAGGTGTTA CTAAACATAT TGAAGAAGAA 


GTTATTAAGC 


AATAGAAACT 


AGTATGTAGT 


480 


TATACTTAAA TAATATGAGC ACTCTGTCAA 


ATTGGACTGA 


TGAGTTTAAT AATTGAAGTT 


540 




AGCCAACGAT ACGTTGTCTA GCTTCTTTTT 


TATATGGATA 


AATGaAAGGG 


ACAATAAATA 


600 


IS 


TAAATAGCAA TTGTTTAAAG ATAAACGTAA 


TCAAATGTGT 


TGTTTTAATT 


AATATAAGTA 


660 




GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA 


AATAGTCTAA 


TTCTTATTTA 


720 




ATAAGTAGAA ATAAGATTAT 








740 


20 


(2) INFORMATION FOR SEQ ID NO: 403: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 



ATGCCAATAA TTAAACCATG 


TAATAAATAT 


ACGTATAGCG 


TACGACTACC 


AATATAAGTA 


60 


TATAATTTTT TCTTTGTTGA 


CATTAAATTT 


AGAAACGCAG 


TCATTGCGAT 


TAATATAATT 


120 


CCATATAATA TAAGTCGTTT AAAAGGACTG 


AATATACTCT 


GTCCTTCATT 


TTCAAGTGAA 


180 


GTATATGGTG AACTTCCCAA 


TAACCAATCT 


GCATTGATAG 


GATGAATCAC 


GTAAACGATA 


240 


AAAAACAAAA TAAAGGTAAT 


GATAGATACT 


GGTATTAGTT 


TTTTATTTTT 


AAAAATAGCC 


300 


GTATGTTTTT TGGTGAAAAT 


GTAACCTAGA 


TAAAATATTG 


GGAAAAATAC 


GATTGTCCTT 


360 


GAAATGCTTA AGTAGCTATC 


GATGTTATCT 


GAAAAACCTG 


CTCCAATAGA 


TATAATAATT 


420 


GAAACTGATA GCACTTTATA 


TGGATTAAAT 


CTTCTAACTA 


TTACTAAAAT 


GACATGAAAG 


480 


AAAAATAGCG TGATCAAAAA 


CCATAACGCA 


AATACTGGGT 


TAAAAGGATC 


AAGTTGTAAT 


540 


TCGTCACTTT TACCTGTTAA 


GAAATAATAA 


ATTGAAAAGA 


ATGCAAAAAA 


TATCATATAA 


600 


GGTACTATCA AACGTTTTGA 


AATTTTTTCT 








630 


(2) INFORMATION FOR SEQ ID NO: 404: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 
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(2) INFORMATION FOR SEQ ID NO : 401: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 60 
CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA GTATTATAAA 120 
ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 180 
AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 24 0 

20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 300 

ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 3 60 

AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 420 
GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAT^AAT GATTTTTAAT ATTCCGATTC 4 80 

GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 540 
ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 600 
TATAATGCAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 660 
TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 720 
ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGGTCAAT ATAGACTTAA 780 
AATCGTATTA AATGCTGAAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 840 
TAGTTTT 847 
40 (2) INFORMATION FOR SEQ ID NO: 4 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 0 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



35 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 60 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CATCATGAAA 120 
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ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 
CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTCC CAAT7TTAAT TCTTATTATA 
GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 
AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 
(2) INFORMATION FOR SEQ ID NO: 400: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,5 (D) T0POUX3Y: linear 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AQCTC-AACAA 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 

ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTTAArAAC 

TTGAATGTAA ATGATGTTCT ATATGCAGGT CATAATAGCA CATGGGACCC TCAATCAAAT 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 

GATAAAGATC ATAAACAACC AAAACAATTA GTCAATX3AAG TTGTGACTGA AAAACCTAAG 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 

CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 

ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA ATCATCTTAA 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 

GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 

40 TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 

ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATTTTTCTTC TTTTTTATCT 

ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 

45 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 

TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 

TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 

TTATTTATCA AATGTGCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 



540 
600 
660 
706 
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AATGTGGGGT CGTATAAGAG CTTGTAATGC TTTTTGGCGT AATTGATAAC TGCTTTAAAA 300 

CCTGCCCACA CAGATCGTGC ATAGTCAACA GAAAGACCTG CATCGTTTAA CAAATAATTC 360 

CTGAAAGCAG TACATTGCGT AGTAGTGATT TTGCCAATAG GGATATTTCC GAACCTTTCT 420 

TTTATGTGAG TATTATATTC TGTAGTTCGC TTTTCTATTG AGCGTGCAGA AAGATTTTCA 480 

TTTTTTAAAC GATCAAAAAA TATATATTCA AAGGGTTGAT TGTCCX5AGTA TCCATATTTA 54 0 

ACATTTTGTA TAAATTCGCT TTCAGCTAGT TTGGCATCTT TCTTACGTTC AAACCCACGC 600 

TTCATTTTTC GTTTGTTATT ACCGTATACA TCTTTATATC TAATGGAAAA ATACCATTTA 660 

CCTGTATTAT CATCCTTATA TACTGGCATT TTGCTTCTCC CTCCTCAAAA TTGGCAAAAA 720 

ATAATAAGGG TAGGCGGGCT ACCCGAAATT TAGTACTAGG TACTAAATGT GATATAATAA 780 

AATAAAAAGT AGGTGATGTT ATGACATTTA AAAACAATCA TAATTTCAAT GAATTAGTTT 84 0 

TAACGAATGA AGACATTAGA ATTTTAAAAA ATGTCTTAGA AGATGCAGTC AGTGTTTATG 900 

ATGAATATTC GGTATGTAAT GAAGAATCCG ATTTTGCTTA CTGTTTATTA AGAGACTTAT 96 0 

ATACATTAGA CAGCTTAGCT ATTTCGTCAA ATAATGTTTG AATTATCGAA TTGTACTCTT 1020 

CGATTTTAAT ACCATGCATA ATAGAGTTTC TGTGTTCAAT AGCAGCTTTG ACTGAATGTk 1080 

TTAAATGTTC TTCTATTAAA TCGTTGTTTk CCAtTTCGtk TAaAAATGTt CyTATATTCC 114 0 
T 

30 (2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 
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1141 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 99: 

ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 

CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 12 0 

ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA 24 0 

ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 3 00 

TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 
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TATCATTGCC ATGTTTTCXA CTTCTTTCAA TAAAAAATAA AATCACTAAA TTGCTGCTTG 
AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 
GaCrrrCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 
GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 
TGCTTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTCTGCCG TAGATAAGAA TCTACCTTCA 1320 
AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 
TCC 

(2) INFORMATION FOR SEQ ID NO: 397: 



1$ 



20 



30 



35 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97: 
25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CITAACGGGT 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTOAGGCGC 
CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 
GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 
CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTCCAGTAG 
GTATTTTAAT GGCkGTATtA TGGGCGATTA TTCCCAAAGA eCTACCTGaG CrACATAGwa 
TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 
(2) INFORMATION FOR SEQ ID NO: 3 98: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 
5^ aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGITCTAC TTTITCATCA 



1080 
1140 
1200 
1260 



1380 
1383 



60 
120 
180 
240 
300 
360 
415 



60 
120 
180 
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GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 1020 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 1080 

5 GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGATC 114 0 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 1260 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 132 0 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 1365 



(2) INFORMATION FOR SEQ ID NO: 396: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 B3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 
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AATTCCTGGT 


GCAATAATAA 


ATAGGATGAA 


AAAGATTCGG 


AAAATATGAT 


AACTCGTAAT 


60 


CATAGCAACA 


TCGGCACCAG 


TAGCTAATGC 


AACTAAAACT 


ATCTGATTAA 


CCCCTCCTGG 


120 


TGCTGCACCA 


AGaAACAATT 


CATTAATAGG 


ATTATTATCA 


AAGAAATGTA 


TGATATAAAC 


180 


CATGATTAGC 


GCACCAATTA 


TCAACATAAT 


ATTTTGAATT 


GTAATTGCGA 


TTGCTAGTCT 


240 


ACCTTTTAAA 


TCTGACAATA 


AATGCGCAAT 


TTGAACTCCA 


ATTCTAATCA 


TATATATTAG 


300 


TTGTGCCATG 


TTCAACAACC 


AATGATCTAG 


TGTAAATGTT 


AAACCTGTAG 


AAAAATTCCA 


360 


AACAATTAAT 


ACAATGAGTG 


GTGCTAATAA 


TTGAAATGTT 


GGAAACTTTA 


TTTTAGACAT 


420 


AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC 


TAAAATAACT 


ATTTGCCCTA 


TGTTTAATAC 


480 


TTGTGATAAA 


GGCAAGACTT 


TTGTTAACTT 


TCCATTCGCA 


TGCATGTTAC 


CATCATGAAA 


S4 0 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC 


AACAAATATA 


ATTCGTGATG 


TTTGCGTTAA 


600 


GCTAACAACT 


AACAAATTAG 


CACGTTTGTC 


TTGTTCAGCC 


ATGACCAGCA 


TTTGTGTTAG 


660 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT 


TTCTGTATTA ATACGTGCAA 


•rmTTrAAA 


720 


AACAAATGCC 


ATTACTATTG 


CAATTAATAA 


TATCGAAATA 


GATACAACAA 


TAATCGAAAG 


780 


CCAATTGTTT 


TTAATATCCA 


TAACGACATT 


TTTCGTAAAC 


GTTGATCCGA 


TTTGCACACC 


840 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA 


GAATGGCCAT 


TTAATATCAA 


GTTTGAAAAC 


900 


TTTTACACAA 


ATGATTGATG 


CGATAATAGG 


ACCAAACATA 


AATGGAAGTA 


ATACGTGCGA 


960 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTC3GAGA TTTAATTCCT GTTTCAGGTA 
TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTCCCAG 
TTCTACGTAT TGATGAGCCT ACTCTTCAAA TX3ACATTTAA AGTTAACAAT TCTCCATTTG 
CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGITTA AATCAACAAT 
TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TCGGTAGTTG 
CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 
AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATCGTGTA ATGTCTGAAC 



900 
960 
1020 
1080 



--"-"w»vjni iv. i.«.v^tiArACA TGGGTAGTTG 1140 

1200 

1260 
CATTTGAACG 

'5 1270 
(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 
20 'B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2^ (^i> SEQUENCE DESCRIPTION: SEQ ID NO: 3 95: 

AAGACCAGGA GAACAmGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATCGAATT 60 

TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACGCAACTC CTGTnGCTTT AGAATATTTA 120 

CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAc CAGGTGCTAG AaCAGCAATA 180 

ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 240 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTCCCTGT 300 

CCAGGTTTTG TGCCACTTXTT AGAACAAATG AGATATAGTC ATCCAACAAT TACAAGCATT 360 

GTCATTCATC AAACACTGAA ACGITGGCGT AATAGTGAGT CTGATACTCT CATTTTAGGA 420 

TGTACCCACT ATCCATTGCT CTATAAACCT ATCTATCATT ATTTTGGTGG TAAAAAGACA 480 

GTGATTTCGT CTGGATTAGA AACGOCTCGT GAAGTTAGTG CATTCCTAAC ATTTAGTAAT 54 0 

GAACATGCAA GTTATACTGA ACATCCAGAT CATCGATTTT l-PGCAACAGG TCATCCTACT 600 

CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 660 

GTGAATGACT AGGAGGATTT TTAATGAAAG AGATTGTTAT TGCATCGAAT AATCAAGGGA 720 

AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTACCACGT AATAGGTATT TCAGAACTAA 780 

TACCAGATTT TGATGTGGAA GAAACAGGAT CAACATTTGA AGAAAATGCT ATATTAAAAT 84 0 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATCACAGT GGACPAGAAG 900 
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AGTGTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG ATGGAAAAGC 240 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 300 

^ GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 360 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 42 0 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 4 80 

10 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACGATAT GAAGCGTATT ATGGTAAAAT 54 0 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 600 

TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 66 0 

IS 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 720 

TTTGCAAAAT TGATGTTA 73 8 
20 <2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^5 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 
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AAAAGTTGTA 


ATTAAAAGTG 


GGATTTTACT 


TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 


ATAAAAGAGn 


AGATGTCCGC 


AATATAGCAA 


TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


CTTTAGTAGA 


TGAGTTGTTA 


AAACAATCTG 


GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


180 


AACGTGCAAT 


GGACTCTAAC 


GATATCGAAA 


GAGAGCGTGG 


AATTACGATT 


CTAGCCAAAA 


240 


ATACGGCTGT 


TGATTATAAA 


GGTACACGTA 


TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


ACTTTGGTGG 


AGAAGTAGAA 


CGTATTATGA 


AAATGGTTGA 


TGGGGTTGTC 


TTAGTAGTAG 


360 


ATGCGTATGA 


AGGTACAATG 


CCTCAAACAC 


GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 


ACCTGAAACC 


TGTTGTTGTT 


GTTAATAAAA 


TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


480 


TTGTAGATGA 


AGTTTTAGAT 


TTATTTATTG 


AATTAGAAGC 


AAACGnTGAA 


CAATTAGAAT 


540 


TCCCTGTTGT 


TTATGCTTCA 


GCAGTAAATG 


GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 


ATGATAATTT 


ACAATCATTA 


TATGAAACAA 


TTATTGATTC 


ATGTACCAGC 


TCCAATTGAT 


660 


AACAGTGATG 


AGCCCATTAC 


AATTTCCAAG 


TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 


GACGTATTGG 


TATTGGTCGT 


GTATTCAGAG 


GTAAAATGCG 


TGTCGGAGAT 


AATGTATCAC 


780 



ss 
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TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA 


TGATGAACAT 


CTACGTAAAA 


1260 




TTGAAGCAGA TGTTCCAATT TATTATTATG GATTTAAAGA 


TTCGGATGAC 


ATTTATGCTC 


1320 


5 


AAAATATTCA AATTACGGAT AAAGGTACTG CTTTTGATGT 


GTATGTGGAT 


GGTGAGTTTT 


1380 




ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT 


TTTAAATGCA 


TTAGCTGTAA 


1440 


10 


TTGCGATTAG TTATTTAGAG AAGCTAGATG TTACAAATAT 


TAAAGAAGCA 


TTAGAAACGT 


1500 




TTGGTGGTGT TAAACGTCGT TTCAATGAAA CTACAATTGC 


AAATCAAGTT 


ATTGTAGATG 


1560 




ATTATGC7VCA CCATCCAAGA GAAATTAGTG CTACAATTGA 


AACAGCACGA 


AAGAAATATC 


1620 


15 


CACATAAAGA AGTTGTTGCA GTATTTCAAC CACACACTTT 


CTCTAGAACA 


CAGGCATTTT 


1680 




TAAATGAATT TGCAGAAAGT TTAAGTAAAG CAGATCGTGT 


ATTCTTATGT 


GAAATTTTTG 


1740 




GATCAATTAG AGAAAATACT GGCGCATTAA CGATACAAGA 


TTTAATTGAT 


AAAATTGAAG 


1800 


20 


GTGCATCGTT AATTAATGAA GATTCTATTA ATGTATTAGA 


ACAATTTGAT 


AATGCTGTTA 


1860 




TTTTATTTAT GGGTGCAGGT GATATTCAAA AATTACAAAA 


TGCATATTTA 


GATAAATTAG 


1920 




GCATGAAAAA TGCGTTTTAA TATGTTTATA ATAGAGTAGT 


ATGGGTATTT 


ATTATTAATG 


1980 




ACATTATTAC ATGTTAATTA GGAGGCGTTT TTAATGGATT 


GGATTTTACC 


AATTGCTGGA 


2040 




ATTATCGCTG CGATTGCATT CTTAATTTTA TGTATCGGTA 


TCGTAGCTGT 


ATTAAATTCT 


2100 


30 


GTTAAGaAAA ACTTAGATTA TGTTGCAAAA ACACTTGACG 


GTGTAGAAGG 


TCAAGTTCAA 


2160 


GGTATTACTC GTGAAACAAC AGATTTACTT CATAAAGTAA 


ACCGTTTAAC 


TGAGGATATC 


2220 




CAAGGTAAAG TAGATCGTTT AAACTCAGTT GTAGATGCTG 


TTAAAGGTAT 


CGGTGACTCA 


2280 


35 


GTACAAACGT TAAACAGCTC TGTAGATCGT GTAACAAATT 


CAATTACACA 


TAATATTTCT 


2340 




CAAAATGAAG ATAAAATCTC ACAAGTTGTT CAATGGTCAA 


ATGTTGCAAT 


GGAAATTGCA 


2400 




GACAAATGGC AAAATAGACA CTAC 






2424 


40 


(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 









45 (D) TOPOLOGY: linear 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 
CnATAATATT CnTCAAnCCT TTPJKATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 6 0 

GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA X20 
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20 



25 



30 



40 



4$ 



2407 



GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCXSA ACAAGACATT 2280 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 234 0 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 24 00 
ATTAACG 

(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

ATGATGnATG GCnCCGCCAA GAAGTTGAAC CAGTCTATTG GCTAGAATTG CTCAAAAAGC 60 

GAGAcaTGTG GTATTCaTAT GTTaGTAgCt ACGCAAAGAC CATCTGTCAA TGTAATTACA 120 

GGTTTAATTA AAGCCAACAT ACCAACAAGA ATTGCATTTA TGGTATCATC aAGTGTAGAT 180 

TCGAGAACGA TATTAGACAG TGGTGGAGCA GAACGCTTGT TAGGATATGG CGATATGTTA 24 0 

TATCTTGGTA GCGGTATGAA TAAACCGATT AGAGTTCAAG GTACATTTGT TTCTGATGAC 300 

GAAAT-TGATG ATGTTGTTGA TTTTATCAAA CAACAAAGAG AACCGGACTA TCTATTTGAA 360 

GAAAAAGAAT TGTTGAAAAA AACACAAACA CAATCACAAG ATGaATTATT TGATGaTGTT 420 

TGTGCATTTA TGGTTAATGA AGGACATATT TCAACATCAT TAATCCAAAG ACATTTCCAA 4 80 

ATTGGCTATA ATAGAGCAGC AAGAATTATC GATCAATTAG AGCAACTCGG TTATGTTTCG 540 

AGTGCTAATG GTTCAAAACC AAGGGATGTT TATGTTACGG AAGCAGATTT AAATAAAGAA 600 

TAATTATGAG TAAGGAGTTT TATATAATGA CACACTATCA TTTTGTCGGA ATTAAAGGTT 660 

CTGGCATGAG TTCATTAGCA CAAATCATGC ATGATTTAGG ACATGAAGTT CAAGGATCGG 720 

ATATTGAGAA CTACGTATTT ACAGAAGTTG CTCTTAGAAA TAAGGGGATA AAAATATTAC 780 

CATTTGATGC TAATAACATA AAAGAAGATA TGGTAGTtAT ACAAGGTAAT GCATTCGCGA 84 0 

GTAGCCaTGA AGAAAtAGTA CGTGCACATC AATTGaAATT AGATGTTGTA AGTTATAATG 900 

ATTTTTTAGG ACAGATTATT GATCAATATA CTTCAGTAGC TGTAACTGGT GCACATGGTA 960 

AAACTTCTAC AACAGGTTTA TTATCACATG TTATGAATGG TGATAAAAAG ACTTCATTTT 1020 

SO TAATTGGTGA TGGCACAGGT ATGGGATTGC CTGAAAGTGA TTATTTCGCT TTTGAGGCAT 1080 

GTGAATATAG ACGTCACTTT TTAAGTTATA AACCTGATTA CGCAATTATG ACAAATATTG 1140 
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TATGGCArrC GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCXSAA GAAAATCTCA 
ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 
^ TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTC l-TTCAGATAA ATATGTCGGT 
CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 

10 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATAITAT ACTCATTTTA 
TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 
ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 
CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTnTG GTCTTAAATA 
AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT rrAGAACCTXS 
GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 
CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 
CTCCCATCAT CTrrCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 
2S GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGnTA ATCACATTCG 

TATTAATGAT TCATTTCX3TG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 
TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 
AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 
CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTrGTCTTT 
GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 
TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCrTTG 
TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 
TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 
AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATCAAAGA AAATTTTTGG 
AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 
45 TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 
GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 
ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 
GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 
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AGGTATCAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 84 0 

AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 960 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATGATTC 1020 

TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC CGCAACAAAA 1080 

AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 114 0 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 1200 

TTTCTTAATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 1260 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 132 0 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 1380 

TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAC ACTGGTGTAA ATCAAGGATT 144 0 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 1560 

AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 162 0 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 168 0 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

30 tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 1788 
(2) INFORMATION FOR SEQ ID NO: 3 91: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 

GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 120 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 

ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 3 00 

TITGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT ego 

AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 720 

ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 780 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 840 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

AAATGCATCA GCTTGCTTAG AATGATTGTC CrrrTTTTTGA TAATAGCGTT CCATTGCAAT 960 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA J^TCAACACG 1020 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 114 0 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 1200 

ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 124 9 
(2) INFORMATION FOR SEQ ID NO: 390: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 90: 

AATGATGATT CATTCAAAGA AATTAACGCT CGGTATATGC TTGGTATTAC TCATTATATT 60 

GATTGTAGGT TATGTCATTA TGACAAAAAC AAATGGTCGA AACGCCCAAA TTAAAGACAC 120 

ATTTAATCAA ACATTAAAAT TATATCCAAC CAAAAATCTC GATGATTTTT ACGATAAAGA 180 

AGGCTTTCGA GATCAAGAAT TTAAAAAGGG TGATAAAGGT ACTTGGATAG TTAATTCTGA 24 0 

AATGGTAATC GAGCCAAAAG GTAAGGATAT GGAAACGAGA GGAATGGTGC TCTATATCAA 300 

TCGCAATACT AGAACCACAA AAGGGTATTA TTTTATAAGT GAAATGACAG ATGACAGTAA 360 

CGGCAGACCA AAGGATGATG AAAAAAGGTA TCCGGtAAAA ATGGAACATA ATAAAATCAT 420 

45 ACCAACGAAG CCACTACCGA ATGACAAGTT AAAAAAAGAG ATTGAAAACT TTAAGTTCTT 4 80 

TGTACAATAT GGCAACTTTA AAGATATTAA TGATTATAAA GATGGTGATA TTTCATATAA 54 0 

TCCTAATGTA CCAAGTTATT CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 6 00 

ACAGTTaAGa AAAAGATATG ATATTCCAAC CAAACAAGCG CCGAAACTAT TATTGAAAGG 66 0 

CGATGGAGAT TTAAAAGGTT CATCCGTAGT TcTAGAAGTC TTGAATTTAC CTTTGTCGAA 720 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3B8: 

^ TCATCCAAAT TTTGGAAATT CCACATTTTA CATATCGTAA TTTTTTAGGA AACTAGTGAA 60 

TATAACAAAT CCCTCCTCTC ATTTTTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 120 

CTAAACTGCA ACGGTTCCTA ATACCAAAAT CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 180 

10 ACTTATTGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCATTT ATTTTATAAA 24 0 

CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 3 00 

TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 3 60 

CTGAAATTAT CAATTTAGAA TCAATTTTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 420 

GCGATTTACA TGGTX3AATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 4 80 

TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 54 0 

TAACTGCTCT TGTCTACTAT CCAGAAGrCm AATTtAAAAT TgATTAAAAG TGATTTCCAA 600 

AATTgCGGtC mACtTAATGt CyGGtATATC ACaACmATCG aACATTTAAT TGAGTTAATT 66 0 

AAATATTGT 9 

(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGT^AAT GGTGCAAAAT 120 

AAAACAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC IBO 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 240 

TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 300 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 360 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA ACATAATACG ATTAATAATA 420 

GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 4 80 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 54 0 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 
TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 
GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 
AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



900 
960 
1006 



600 
660 
720 
780 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT rrTGTAATAG 84 0 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 
ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 
GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 
GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 
ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATCAAAGAAG 
CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 
CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTCTTT 
ATGGGATAAG TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 
CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 
40 TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 

CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 
CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 
AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATIT 
TT 

(2) INFORMATION FOR SEQ ID NO: 3 88: 



60 
120 
180 
240 
300 . 
360 
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480 
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CAGTTTTACG 


TCCTACACCT 


GCTAAACTTT 


CTAATTCCTT 


ATGTGTTTGT 




240 




CATTAAATTG 


ATCAATCAAA 


GATTGACAAA 


GTTTCTTAAT 


ATTCTTAGCT 


TTGTTACGAT 


300 




ACAGACCGAT 


AGAACGAATA 


TCATTCATAA 


GTTCTTCATC 


ACTG ACTGC C 




360 




CAGGOj'I"!^!"!' 


GTA'l~l''i''l"i"i:A 


AACAGCTCAG 


TTGTTACTCT 


ATTTACTAGA 




420 




ATTGCGCTGA 


CAATAATACA 


GCAATAGTTA 


ATTCGAACGG 


ATTATCATGT 


TTTAATTCAC 


480 


10 


ATTCTGCATC 


CGGAAACATA 


TTTGCTATAA 


CATCAATCAT 


TTCTAATGCT 


TTTTTCTTAC 


c A e\ 




TTACCATCAA 


GGTTCTCCCC 


GTTTAACCAA 


TCAAATTTAG 


GTACCGTTTT 


AACTGTGTG C 


600 


IS 


GTCATTTTCG 


GTTTATTGAA 


TTTTTCTCTT 


ATTTTTCTAG 


AATCGTCAAT 


TGTTTTGACA 




TTGTTTTTCT 


TCCAATTAAG 


TAAAATACX3A 


TCTATATATT 


TAAAGCTAAG 


TTTATTCAAA 


720 




CTATTCGCCT 


CGTCTAATGC 


CGCTTGTATA 


ATTGCAGTAT 


CGTGTTTATC 


AACATCAATC 


780 


20 


CATTGATTTA 


ACGTTTCTAT 


TTCATATGGA 


GATAACGGCC 


TTGCAAATGT 


ATCCTCTAAA 


840 




ACTCTAAATA 


ATTGTTTAAA 


TTTTTCTTTA 


CTATTTAGCT 


CTTTCGTTTC 


CATACTTTGT 


900 




TGCTTCAATA 


TATGACTTAA 


TTTTTCGAAA 


AAAGGATCTA 


GATTCATATA 


TTCGGkAAAT 


960 


25 


CTACCTTCyT 


CATCyTTTTG 


aACTkGtAAT 


tCTAGCAATT 


CACGTgTATC AAATTTTGGA 


1020 



TACCATT 1027 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC GCATGGAATG 60 

40 

TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 120 
GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA CAAGCAATAC 180 
45 CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 240 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 300 
ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 360 
CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 420 
GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAaCCAAa 4 80 
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(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 



10 





(xi ) SEOUENCE npcinpTii'T'TO'M . 


SEQ ID NO : 


384 : 








TGCACTTGAT ACTTTAGCAP AmaaATO^r'r' 


TAATCGATTA 


GTTGCAAATG 


ACAAGAATGA 


60 


IS 


AGCAACTTTG GAAATGACT A a Ta ana Tr^r* r« 


AACGATTCGT 


TTTACAGAAC 


CTACGCTGAT 


120 




TGCATTAGCA GGGGGTAATT: Tnaaa^r'^pTA 


CACTG AG CAT 


ATGACTATAT 


CTCCATATAA 


180 




ATTGTATTTG TTAH A T a a a r* n r*r* a Tv^T^r^rn 


AAAGTTTAGA 


GAAACAAGTT 


ATACATCGCG 


240 


20 


AGTGTATTTA GCTGTaGGAG GCf^nATT^r^a 


Al lAGATGCA 


TGGTTAGGAT 


CTAACTCAAC 


300 




CGACTTTAAT GTAAAAATTr* r"rr!i^'r*r*T*a^K 


AGGTAGAACA 


TTACAAGATG 


GCGATGAAAT 


360 




MMALiuxxAAU AGAGATTATA CAGCTCGTCA 


TCATAAGTTA 


TTTGAAAACC 


TTGCTCACAC 


420 


2S 


GAAACAAACA GATTGGGGTA TTGATGGATA 


CGCCTTGTCA 


TTTAATTATA 


TGTCTGATGT 


480 




ATTTCATGTC GTTAAAAATA AAGGTACGGA 


AGATTTTAAA 


GAAGATGCCA 


TTCAAAGATT 


540 


30 


TGTGAAACAT GATTATAAAG TAACGAGCAA 


AGCAAATCGC 


ATGGGGATGA 


TGCTTGAAGG 


600 


TGAAAAAATC AAAGCTTTTT ATGAAGATAT 


GCCACCGTAT 


CAGACTGTCA 


AAAAAGGAAC 


660 




GATACAAATT AAGCGTGATG GCACACCTAT 


TATCCTATTA 


AATGATCATT 


ATACGCTAGG 


720 


3S 


TAGCTACCCG CAAATCGGTA CAATCGCAAG 


TTATCATTTA 


ACGAAATTAG 


CACAAAAACC 


780 




GCAAGGATCA CGTTTGAAAT TTCAATTTAT 


AGATATTTTA 


ACGGCTGAAA 


AGAACCTTGT 


840 




TAAgTATAGT 7UVCTGGTTAA AC 








862 


40 


(2) INFORMATION FOR SEQ ID NO: 385: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

50 

AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 

55 
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(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1143 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382; 

GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAGCATGC 6 0 

CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 120 

TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 180 

CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 24 0 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 300 

GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 360 

25 AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 420 

AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 48 0 

TCAAATACAA ATTCGTTAAC GTCACT 506 
(2) INFORMATION FOR SEQ ID NO: 383: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 60 

TACCATTTTC ATCTTTTACC GAAATAGTAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 120 

TATCCGTATT ACCACAAGCT GCAACTAAAA GTAAGGCAAC TATTAATCCC AATATACTAA 180 

AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 24 0 

TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 300 

ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 360 

GATAATGTAT CATTGCTATA TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 420 

55 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGPJKA ' 204 0 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACX3TGTCAT TAGATACAAA 2160 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 2220 
TG 

(2) INFORMATION FOR SEQ ID NO: 381: 



2222 



60 
120 
180 
240 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 ^^^^ SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 
TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 
25 ATTATATGCA AGTTTTCAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 

TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATGT TTTTAGGTTG ATACATTTGA 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 3 00 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 36 0. 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTT^TTTAG GAGTGTTACC ATACGTTTCT 4 20 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 540 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 6 00 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGGTGGTT GTTTGTTAAA 660 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 720 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTGTTTAATT 780 

TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ATTTGTTAAT 84 0 

ATCTTGCCAC GCATCATATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 9 00 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 960 

ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 1020 
GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT TCACACTACT CATTTTCATA 
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ATGCTATTTA GCTAAAGCTA AAAGACCAGA CACTATGCAT ATTTCAACTG GAAATATGTG 24 0 

GCX3ATACTTA GTTGCAATTA TTGCCTGTAT GATTTGGTAC CTTAATAAAG CGCATGTAAG 300 

TATCATCGGT ATAATTATTG GTTTAATGAT TTCATATGTT GTAGTTATCA TACGTCCTTT 360 

ACTAAAGGTG AGCAAATAAA TTAAGAAAGA GGTGAGATTA TGGATCACAA ATCCCCGCTC 4 20 

GTGAGTTGGA ATTTATTCGG TTTTGaTATC GTTTTCAATT TATCAAGTAT ATTGATGATA 480 

CTTGTTACGG cGTTTCTTGT TTTTCTACTT GcTATCATTT GTACGCGTAA TTTGAAAAAA 54 0 

AGACCAACTG GCAAACAAAA TTTCGTTGAA TGGATTTTTG ATTTCGTGAG GGGAATCATT 600 

GAAGGTAACA TGGCTTGGAA AAAAGGTGGT CAATTCCACT TCTTAGCAGT AACGCTGaTT 660 

CTGTACATTT TTATAGCTAA TATGTTAGGT CTTCCGTTTT CTATAGTAAC GAAAGATCAC 720 

ACATTGTGGT GGAAATCZACC GACAGCnGAT GCAACAGTGA CTTTAACGTT GTCTACAACG 780 
ATAATACTGT TAACTCACTT TTATGGAATT AAAATGCGTG GTACGAAACA ATATCTTAAA 84 0 

GGTTATGTAC AGCCGTTTTG GCCATTGGCA ATTATTAATG TTTTTGAAGA GTTCACTTCA 900 
ACATTAACGC TTGGTCTGCG TTTGTACGGT AACATATTTG CAGGTGAGAT ACTATTAACA 960 

TTACTTGCTG GCTTATTCTT TAACGAACCA GCATGGGGTT GGATTATTAG TATCCCAGGA 1020 

TTAATTGTTT GGCAAGCATT TTCAATATTT GTAGGAACAA TCCAAGCATA TATCTTTATT 1080 ' 

ATGCTTTCGA TGGTTTATAT GTCACATAAA GTGGCAGATG AACACTATU^ ATTTCAATAA 1140 

TTATATACAA TCACAGGAGG AAATTAAATT ATGAATTTAA TCGCAGCAGC AATCGCAATT 1200 

GGTTTATCAG CATTAGGAGC AGGTATCGGT AACGGTTTAA TCGTTTCAAG AACAGTTGAA 12 60 

GGTGTAGCAC GTCAACCAGA AGCACGTGGT CAATTAATGG GTATCATGTT CATTGGTGTA 1320 

GGTTTAGTTG AGGCATTACC TATCATCGGT GTAGTAATTG CATTCATGAC ATTTGCTGGA 1380 

TAATTAACAG ATAAAAGAGG TCGGGACAAA GCGCATAGGA CATAATTCAT GATGCATATA 1440 

TAGTAATATC TTTGAACTTT ATTAAATAGT TGAGATATGA ACGCACCATG CCTATCGCAT 1500 

AAATTCAGTA GGTCCTAACC TCGTCGTTTT TTTCTATATA ACACTAGCGA TTATTTTAAT 1560 

GAAAGGAGTG TCATGAACCC GTGACTGAAA CAGCTAACTT ATTCGTTCTT GGTGCAGCTG 1620 

GAGGCGTTGA GTGGGGTACT GTGATTGTAC AGGTCCTAAC TTTCATCGTG TTAcTTGCGT 1680 

TACTTAAAAA GTTCGCATGG GGTCCATTGA AAGATGTAAT GGATAAACGT GAAAGAGATA 174 0 

TTAACAGAGA TATCGATGAC GCAGAACAAG CTAAGTTAAA TGCACAGAAA CTTGAAGAAG 1800 

SO 

AAAATAAACA AAAACTTAAA GAAACACAAG AAGAAGTTCA AAAGATTTTA GAAGATGCTA 1860 

AGGTTCAAGC ACGTCAACAG CAAGAACAAA TTATTCaTGA AGCAAACGTA CGTGCAAACG 1920 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 
C3GAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTAIT ACAGAAGGTA 
CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTC 
AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTCA AGAAGCAGGT GTATTATCTC 
AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 
AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 
AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 

,^ AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 
GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTC GTTCGTAAAG 
AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAArr ar-TftfLi-rrv,^ 

— — .^^wwA oj-ww-u-u-w-V i Vj X 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 
TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 
ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 
TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 
GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATC GATTATTAGT 
ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 
TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 
3^ TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTCATCAAGA 
TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 
(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACAT^n-rTA AACAGTTCAT 
TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 
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AAGTCACTTG GAAATTGTTG CACCTGTTGT TGAGGGGCGT ACGAGAGCAG CACAAGATGA 1860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 1920 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGGAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 216 0 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCXSTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCX3CAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2 340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAT^GGTVAC T7AGACAAGC 2400 

20 TCATGATAAA ATTAATAAAG CTGATAATVAT GGATAATCCA GATATGGAAA AGCCTGCAGA 2460 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 264 0 

AGAACAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

AAAAAGCTTC AATATTTGGG AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC A/^TGGGGA GAAOGTTCAG GATTAACATT 3000 

ATTCTTACCT CATGCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCX; 3 060 

40 ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3 240 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3 300 
AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3 360 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 34 20 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTOPAGPiA GTGTCATGGG TACAAGAAGA 3480 
ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3 54 0 
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TGGGC5AAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG 
AATCGTATCA TAAAATTAGT CGAAGAATTA CTTCAATTCA CTAAAGGAGA TGTAAATGAC 
ATTTCTTCTG AAGCACAGAC CGTGCATATT AATGATCAAA TTCX3CTCGCG AATACACTCA 
TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 
ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTrTTAA TCTTTATTGA TAATGCAATC 
AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 
ATAATTGAAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTrr 
,5 GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 
TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTCAA 
ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTCAGACG TCAATCAAAG 
TCATAGGATC AATTTTTTAA GTACACATTA GCTRTnaPTn z.Tn-r^T'y.y.r.-. 

^ • * w * *T. * v-i^-rtrtv^ 

AACAAATAAA CAGTGGTTCT TTATCATTTC TCTTGTACTC CCAAAATTTA CAATAAAATA 
CATCTATAAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 
GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CGITrCAACA 
TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTrrCAGAGG CTCCTGTAAA 
CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 
ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 
ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 
3S TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC rPTGATTTAG ATCAACAGAC 
TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 
TTATGAAGCA ATTTTAAGAA TGGAAAAACX; TTACAAAGGA CCAATTGCAT TTCAGTATAC 
ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 
AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA C7ATTAGCGT ATCTTGAAGG 
GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 
AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 
^ AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTrTTAACGC ATCTCTTAGA 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATGA AATTCTTACC 
TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG 
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CATTGATTTA 


TATGATGTGA 


TTaTTkCACA 


AGGTGCCTCT 


ACTGATTCAA 


ATTACGTTCA 


660 




ACAATATCAA 


TTACCAGGTC 


ATTTTGCX3CC 


AATTGCTTCT 


TATCAATTAT 


TAGAAAAAGC 


720 


5 


AGTTGAAACA 


GCACGTGACA AAGGTGTACG 


TCATCATGTA 


GGTAATGTGT 


TATCAAGTGA 


780 




TATTTTCTAT 


AACGCGGATA 


CAACAGCGAG 


TGAACGTTGG 


ATGCGTATGG 


GTATTTTAGG 


640 




TGTAGAAATG 


GAATCaGCTG 


CaTTATACAT 


GaATGCAaTT 


TACGCTGGTG 


TCX5AAGCATT 


900 


10 


AGGTGTGTTC 


ACAGTGAGCG 


ATCATTTAAT 


TCATGAAACG 


TCAACAACAC 


CTGAGGAAAG 


960 




GGAACGTGCA 


TTTaCAGATA 


TGATTGAAAT 


TGCACTGTCA 


TTGGTGTAGA 


TGATTATGAA 


1020 




TGTTGAATAT 


TCTAAAATAA 


AGAAAGCAGT 


ACCTATTTTA 


TTATTCTTAT 


TTGTATTCAG 


1080 


IS 


TTTGGTTATA 


GACAACTCAT 


TTAAATTGAT 


TTCTGTAGCC 


ATTGCTGATG 


ACTTAAACAT 


1140 




ATCTGTAACG 


ACAGTAAGTT 


GGCAAGCGAC 


ATTAGCCGGT 


TTAGTAATTG 


GTATTGGCGC 


1200 


20 


TGTAGTATAC 


GCTTCATTAT 


CTGATGCCAT 


TAGTATACGC 


ACACTATTTA 


TTTATGGCGT 


1260 


GATATTAATC 


ATTATCGGAT 


CAATTATTGG 


TTACATTTTC 


CAACATCAAT 


TCCCATTACT 


1320 




TTTAGTTGGA 


CGTATTATTC 


AAACTGCCXK5 


TTTAGCTGCT 


GCAGAGACAT 


TATATGTGAT 


1380 




ATATGTTGCA 


AAGTATCTTT 


CTAAAGAGGA 


CCAGAAGACT 


TACCTTGGCT 


TAAGTACGAG 


1440 


CAGTTATTCC 


TTGTCATTAG 


TTATCGGTAC 


ATTATCAGGT 


GGATTTATTT 


CTACGTATTT 


1500 




ACACTGGACA 


AATATGTTTT 


TAATTGCATT 


AATCGTAGTA 


TTTACGTTGC 


CATTCCTATT 


1560 


30 


TAAATTATTA 


CCAAAAGAAA 


ATAATACGAA 


TAAAGCTCAT 


TTAGATTTTG 


TTGGCTTAAT 


1620 




TCTAGTGGCA 


ACTATTGCTA 


CAACAGTCAT 


GCTGTTTATT 


ACGAACTTTA 


ATTGGTTATA 


1680 




TATGATTGGT 


GCCTTAATTG 


CGATTATCGT 


TTTTGCGCTA 


TATATTAAAA 


ATGCX3CAACG 


1740 


35 


TCCATTAGTA 


AATAAATCAT 


TTTTCCAAAA 


TAAACGTTAT 


GCTTCATTTT 


TATTTATAGT 


1800 




ATTTGTAATG 


TATCCTATCC 


AATTGGGTTA 


TATTTTTACG 


TTCCCATTCA 


TAATGGAGCA 


1860 




AATTTATCAT 


CTGCAACTAG ACACAACATC ACTGTTATTA GTACCGGGgT 


TaTATAGTAG 


1920 


40 


CAGTCATTGT TGGtOgCACT AAGTGGgTtA AAATCGGgCG 


rAATATCTGG 


AATTCCAAAA 


1980 




CCAAGCGGAT 


TATCACAGCC 


AATTAA 








2006 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

SO 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
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ATGATTTTtA 


aAAATCATTA AGTTAAGGTr 


GATACACATC 


TTGTCATATG 


ATCAAATGGT 


60 


TTCGCCAAAA 


ATCAATAATC AGACAACAAA 


ATGTGCGAAC 


TCGATATTTT 


ACACGACTCT 


120 


CTTTACCAAT 


TCTGCCCCGA ATTACACTTA 


AAACGACTCA 


ACAGCTTAAC 


GTTGGCTTGC 


180 


CACGCmTTAC 


TTGACTGTAA AACTCTCACT 


CTTACCGAAC 


TTGGCCGTAA 


CCTGCCAACC 


240 


AAAGCGAGAA 


CAAAACATAA CATCAAACGA 


ATCGACCGAT 


TGTTAGGTAA 


TCGTCACCTC 


300 


CACAAAGAGC 


GACTCGCTGT ATACCGTTGG 


CATGCTAGCT 


TTATCTGTTC 


GGGCAATACG 


360 


aTGCCCATTG 


TACTTGTTGA CTGGTCTGat 


ATCcGTGAGC 


AAAAACGGCT 


TATGGTnTTG 


420 


CGAGCTTCAG 


T 








431 


(2) INFORMATION FOR SEQ ID NO: 378: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doioble 

(D) TOPOLOGY: linear 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
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TTTnTATAAC 


GTATTATAAA 


TCGTTAAAAA 


TTTTGGTTGT 


GTTTGCGTCA 


CGTAGACAAC 


60 


CTCCATAAAG 


TTACTTAATC 


ACTCTCATCA 


TACAATAATT 


TTTACTCAAA 


TTGGAAnAAT 


120 


TATAAAAATT 


AAATATAGAT 


AGGCTTTGAA 


AATTAGTTTT 


ATACAAGGTT 


AGTAGCTGTA 


180 


ACTGTAAAAT 


GTTCTTAATA 


TTGTCAAAAT 


GTAATGCTTG 


AAAGCGCTTT TAAAaAATAT 


240 


TATTATATAC 


ATGGTTAGAC 


AAATAGACAA 


ATCACTATAC 


AAATATTGGG 


AGGAATATTT 


300 


TATGAAATCA 


ACACCACACA 


TTAAACCAAT 


GAATGACGTC 


GAAATTGCAG 


AAACGGTTCT 


360 


ATTGCCAGGA 


GATCCGTTAA 


GAGCTAAGTT 


CATTGCAGAA 


ACTTATTTGG 


ATGATGTGGA 


420 


ACAGTTCAAT 


ACAGTGCGAA 


ACATGTTTGG 


TTTTACCGGA 


ACATATAAAG 


GTAAAAAAGT 


460 


TTCTGTCATG 


GGTTCAGGTA 


TGkGTATGCC 


ATCTATTGGC 


ATTTACTCTT 


ATGAATTAAT 


540 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 
TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 
AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 
ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 
TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 
TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 
TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 
TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 
CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 
AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 
ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 
ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 
AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 
CATCGTTGTG GGTCAATGTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 
AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 
AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 
ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 
TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 
GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 
ACTTAATAIT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 
CTGAAAATGA AATGATCACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 
ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 
TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 
TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 
ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 
CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 
GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGTTG 
cATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 
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120 

180 

240 

300 

360 

420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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ATGATGGCTT CACACTGTTA GATGCGACAT CATTACAAGA AC:TAarar-T-r T^KTr^T^KnT^r. 


- 

2340 




CGCATACATT GATTACXSCAA GTTTATAGTG CAATGGTTf5r Tr<-TriAT-T-TY- r n R-ri-nr-rwr* 

\— *t/\iur<jx IvjCTAATTTG AAAATCACTT 


2400 


5 


TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT rAr-rv^Tv-r-^ rv-jv j.r*/>^T.^ 

^^^•'''^ 4. X iTji w\v.TGGTGCA CGAAGCGATG 


2460 




GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA TrATr-aTv-an ftj.'r.-^T.r—^n 

A ^•'vi A A iwiTGATGAA AATGCATTCA 


2520 


10 


ATAATTTGAC GAGTGTATTC GTACCAAAAA TCATAACATr r Ara-rii-n-iv- -rn-rz-Tvov-x^ 

A \— f\ ± K OAC-ATATTTG TATCATGACT 


2580 




TTOATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATnA nm^Tt^T^nnr^n^ ny^n^^^i,^^^^ 


2640 




ATAAAGTGCA AACGCaTGmA AcqCTAAAGC GTTATTTArT Trazi^aaAr-a n-r^^ ,k * rn^,,, 

3 ^ * ■'^*T^VJ^ V7X A AA^l AVjAAvtAAACA TTTGAATTGT 


2700 


IS 


TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTTA aniiar-Tftr'r.A i-TT,-n,.r-nou. 

'*^"*^'»A xiLJvj\_ /^aaxvjAI xtjA ACjAaCTAGGA GATATTTTAT 


2760 




TACAAGTGTT ATTGCATACT AGTATTGGTA AAAAAGAAHf rTL.TU'rnr-T.r- *rrKT.,. * ,^ 

*vvjx^x xvjrvjA>4 aaaaaoaa;^ GTATATCGAC ATTAAAGAAG 


2820 




TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACrr ArannTaTT-r r^Tr^i,^^^^ 

*-wrwi,x%jr^x x\, AA\j>\uAu\„L- AuACATATTT GGTGATGCCA 


2880 


20 


ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA nrrraanr-ftT r-r^r^nr.r^r.rr.^ 

* *.*-*™-u-kV3j-u-u-i. X X xvj^jiv_iAA (avjCijAAAGAT GCTGAAGGTA 


2940 




AACAGCCAAG AGTTAAATTT GAAAAAGTAT TTGCAGAnrA TTn-T-raRRT' t^t.t*^^'^^-. 

v*wM-Ln.vj X -rt A X X v> wiLjAvjUA 1 X X 1 X xAAAT TTATATGAGA 


3000 




AGACGAAGGA TAAGTCATTT GATGAGGCCG CGTTAAAnrA mv^r-Tar'a » n * 

A xjravjviv_\_\j v-*jxxaaaow\ CjrGGCTAGAA AAAGGGGAGA 


3060 


25 


GTAATACATG AGATTAGATA AATATTTAAA AGTATrArrn TT&&'ran7Vi-r- i-**-^-T,*«« 

— j-u-k X X X X ASs7 X A X uAv*tjt> TTAATAAAGC GACGTACGCT 


3120 




AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TArAATaaaT nr^'PTi RTv^r-™ 

'w^-a^jx/^vj.rtAi XAL-AA XAAAT GGTAATGTTG CTAAAGCTGG 


3180 




ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATTCGCrrv nr-Tf-Ra&a»T^ iin/^m^ 

«-k A « X vjv- i X i v-ij^^ XXX ijrtj i V— AAAAAX TAGTAACAGT 


3240 


30 


TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAap nrv-aafw^o^A tv^o,^^^^ , 

A vj X \- X /ia/\vja X AAk- (j Laj AAGGGTA TGTATG AAAT 


3300 




CATTGAAGAG CGTCGACTTG AAGAAGCGTA AATTnna/^T nn^m^T^r^r-T^T^T^ ^n****^,^,, 
^ AW j-i-r^^*;MT-ri\j^\jx« -rtrtx xLai^AtjijX oAt-AAGCAAT GAAAAATAAA 


3360 


35 


GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAAPA Ar2VJiaa&aP& Ror-^^,.**** 
* *'^^~"^^Aw v> \— rirtvjA\jAA\— A AvsAAAAAACA ACGTCAAAAA 


3420 


ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTTC: nnnnnn^iiT^ ooowrv^xm^ 

3 A XrtX X X\j CtalivaVAal Ax r aCTX AGCGAXA 


3480 




ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAArATr rr-aa'rr'aTA'n ov^^o^/^i.^*^ 

■'•vjx x\jx^ v-M*\/u\rt.L^X V- ijv_AAiTSATAT TGAtGCACAG 


3540 


40 


GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAAGAAA ATrAannaaT tv-i-r^rvrf.** * 

X Jv%>j%_xxn.v_/u\A A X ^jAALaAAA X TGCGTTAAAA 


3600 




GAAAAGTTGA ATAATCTGAA TGACAAAGAT TAPAT^naaa j^aaT-rv^rv-rv-^ m^*.Tv^*r™*-« 

X vjrt X X AL-A X A uAAA AAA 1 iG CG CG TGATGATTAT 


3660 




TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG AAttAPAAAra T^rmTr^a/^r. 

* * *■ * «-fcw>j X X v_-/A.VJ rtrlVJrt V-AAAviA X X 1 LTx AGC 


3720 


'to 


TCAAAATCTT CGAAAAAATA AATrPAAAT^ r-a'PT'r^R a a kt* rnT^rT«r^r^«.^r*.i. ^^^^^ 

wwmu-Lrti-u^xrt /\AxuL.AAAil (jATTCAAAAT TATCCGAGTA tagacattgt 


3780 




GAAAAAA 


3787 




(2) INFORMATION FOR SEQ ID NO: 376: 




50 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT CGATGAATGC CATTCCTAGT 54 0 

GCAATTACAC AAAATATAGG GAAGTATCAT AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 
TATATACAAT TAGTTGGTAT ATTATTATTT ATTGCTATTT TTGTGTTTGC GAACAATATT 660 
GCACATATGA TGGGTGATGG CCATTTAACA CCAATGATTC AAGCTGCAAG TTTAAGCTTT 720 
ATATTTATAG GTATGCTTGG CGTGTTAAGA GGTTATTATC AATCTGCAAA TAATATGACA 780 
GTTCCGGCTA TTTCCCAGGT TATAGAACAA GTTATACGAG TAGGTATTAT CATTGTTACT 84 0 

ATTGTTATTT TTGTAGACAG AGGTTGGACG ATATATGAAG CGGGAACAAT TGCTATTTTA 900 
GCATCAACGA TAGGTTTTTT AGGTTCTTCA ATTTATTTAG TAGCGCACCG ACCTTTTAAG 960 

TTTAAAATGG TAAATAACAC TGCAAAGATC GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

ATTTTCGCTA TCAGTCAATT AATCGTAATT TTATGGCAAG TGATTGATAG TGTTACTATT 1080 

ATTAAGTCAC TTCAAGCGAT ACGOGTGCCA TTCGATGTTG CCATAACTGA AAAAGGAGTC 1140 

TATGACCGTG GTGCATCATT TATTCAGATG GGATTGATTG TAACTACAAC ATTTAGTTTT 12 00 

GCGCTCATTC CTCTGTTAAG TGACGCAATC AAAATGAATA ATCAGGTACT TATGAATCGT 1260 

TATGP^TG CGTCATTAAA GATTACGATT TTAATAAGTA CAGCAGCGGG AATAGGATTA 1320 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG TTTTTTAAGA CGAATGATTT AACCTTAACG 1380 

TTAAGTGTTT ATATGATTAC GGTCATTTGT GTATCGTTAA TTATGATGGA TATGGCATTA 1440 

TTACAAGCGC AACATGCTGT GAGACCTATT TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

AAATTTATAC TTAATATCAT TTTGATTCGT TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ACTGTTGTAT CATTAATTAT ATTCGGTACG ATTATCCATA TTGCTGTCAC GAGAAAATAC 1620 

CACTTATATG CGATGAGACG ATTTTTTATC AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ATTGTTGTTC AATGCGTGTT AAACATAGTG ACAACACACG GTAGAATCAC TGGACTCATT 174 0 

GAATTATTAT GTGCAGCAGT ATTAGGTATC ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

AATGTTTTGA CATATAAAGA GTTAACTTAT TTACCATTTG GTTCAAAGTT GTATCAAATT 1860 

AAGAAAGGAA GACGTTGATG GCACATACCA TTACGATTGT TGGCTTAGGA AACTATGGCA 1920 

TTGATGATTT GCCGCTAGGG ATATATAAAT TTTTAAAGAC ACAAGATAAA GTTTATGCAA 1980 

GAACGTTAGA TCATCCAGTT ATAGAATCAT TGCAAGATGA ATTAACATTT CAGAGTTTTG 204 0 

ACCATGTTTA l-GAAGCACAT AACCAATTTG AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

TGGTTGAAGC TGCTAATGAA AAAGATATTG TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 2160 

CTGAGACAAC TACAGTGAAA TTACTGGCTT TAGCAAAGGA CAATACTGAT ATAGATGTGA 2 220 
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TGCATTAATT TTAGGTTTAA TTGGAGGOTT CCTTTTAGCT AGAAAATATA TGAT^^ ' i08< 

CTTGAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATCATCATGA TGCAAATGGG 

TCAAAAACCT TCTCAGAAGA AAATTAATCA AATGAltSACG ATGATX5AATA AAAATATGGA ^^OC 

TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA ^26C 

TGGAAATGGC CTCTTTTTAT AATCAAATTA ATAAGAATAA ATATGTrTAT TAAAATTAAA 1320 

GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATTTTCTATG ATTTACAGAG 1380 

AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTTTCAATC ATTCTAGCTr 1440 

TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT CCGGTAAAkG isoo 

AGAAAAAAAT AGTTTTGCCa CCgTTTOA tGGCgACCGG TCCATTCATG TACGTCGTTC 1560 

CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAGCCTT TATAATTGGT TTGCTTTTTT 1620 

CtACAGTTCT AAITTGGACT TCTCGATTTC AAGTCAAAGG TACAGAAATT TATATGAAAC 1680 

GATCTAAAGC ATTTCCAGTT ATTTTGATTT CATTACTTAT CATTCGTACT GTGATGAAAA 174 0 

TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATCTTCTTT TTATTAGCAT 18OO 

TCTGTATGAT TGTTCCrrGG AGAGCAGCAA TCCTATATAA ATACAAAAAA CTAAAGAAAA I8GO 

CATTAATCAA TTAATTACTT TTaAAACCAC TTGTGATCGA CTTCTAAATC AGTCAATGAG 192O 
TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 787 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : dov±>le 

(D) TOPOLOGY: linear 



25 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



45 



50 



ACATTTGATC 


AAnTTATCGA 


GATTAAAGAT 


GAATTCAnTT 


GATCGTnTCA 


ATGATTATCC 


60 


TGTTGAAGTA 


GCACGTTTGC 


TTGATATAGT 


GGAmATaAAA 


GTACACGCAT 


TACATTCAGG 


120 


TATCcACGTT 


GATTAAAGAT 


AAAGGGAAAA 


TAATTGATAT 


TCATTTATCT 


GTAAAAGCCA 


180 


CTGAAAATAT 


TGATGGCGAA 


GTGCTGTTCA 


AAGCAACACA 


ACCTTTAGGT 


AGAACAATGA 


240 


AGGTTGGTGT 




GCAATGrCAA 


TTACTTTAAC 


GAAACAAAAT 


CAATGGCTTG 


300 


ATAGTTTGAA 


GTTTTTAGTT 


AAGTGCATTG 


AAGAAAGTAT 


GAGAATCAGT 


GATGAAGCAT 


360 


AAAGAAGCAT 


TTAATGGCGT 


TGTCX3TGTTA 


ACTGCTGCAT 


TAATTGTCAT 


TAAAATTCTG 


420 
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AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG 
TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA 
CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC 
CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC 
AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG 
TCAACTAATC CACTCAACAT AAAATCCTCA GTTATACCAT 
TTTTnTTAGG TTCTTCATCT TTTATGG 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 



CCAACAAGAA 
TGGCGAAGAC 
TGGTAGTCAA 
AAACAGCAGT 
CGGTGGATTA 
ATTTATGGTG 



TATCCACAGT 
TTTAAACGTC 
GATAACAACA 
GGTACTGCAC 
ACAGGTATAT 
TAGCCGAGGA 



900 
960 
1020 
1080 
1140 
1200 
1227 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
CCATATGGtG CAACATTCTT CGTATTTAGT GATTATTTAA AACCAGCGTT ACGTTTATCA 
TCAATTATGG GATTAAATgC aACGTTCATC TTCACACATG AyTCaATTGC AGTAGGTGAA 
GATGGTCCTA CTCATCAACC AATTGAGCAA TTAGCTGGAT TAAGAGCCAT TCCAAATATG 
AATGTTATCC GTCCTGCTGA TGGTAATGAA ACAAGAGTAG CATGGGAAGT TGCCTTAGAA 
TCTGAATCTA CACCTACTTC ATTAGTATTG ACACGTCAAA ACTTACCGGT ATTAGATGTA 
CCAGAAGATG TAGTTGAAGA AGGCGTTCGA AAAGGTGCCT ATACAGTTTA TGGCTCTGAA 
GAGACACCAG AATTCCTATT ATTAGCTTCA GGTTCAGAAG TTAGTCTTGC AGTTGAAGCT 
GCTAAAGATC TTGAAAAACA AGGTAAATCA GTACGTGTTG TTTCAATGCC TAACTGGAAT 
GCATTTGAAC AACAATCTGA AGAATATAAA GAATCAGTTA TTCCATCAAG CGTAACAAAA 
CGTGTTGCGA TTGAAATGGC TTCACCGCTT GGATGGCATA AATATGTAGG TACTGCAGGT 
AAAGTTATTG CTATTGACGG CTTTGGCGCA AGTGCACCTG GCGATTTAGT AGTTGAAAAA 
TATGGATTTA CAAAAGAAAA TATCTTAAAC CAAGTTATGA GCTTATAAGA ATAATTTATA 
AAGCGAGTAT GTTTAGAAGT CTAGGATGCA TAATCTTAGG CTTCTTTTAA AGTGTTGAAA 
TTTAGAGTAT AGCACTTAAA CTACATCATA AGTGATAAGT TATGAAAGTA TACTATTTCA 
GATTAATCTT TAAAAGCTCT GTTATAACAG CATGATTTTT GATATTATTT TTAGTATCGA 
TATTAAAATA CTTGAATAAA CTAGTTCTTG AAATAATGTG ATGAATTTAG TAAAATTCAG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 
TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 
TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 
TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 
TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 
AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 
TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 
CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 
TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 
20 (2) INFORMATION FOR SEQ ID NO : 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 



30 



35 



SO 



60 
120 
180 
240 
300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 
ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 
aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 
ACAGAAACCA AGCAACAGAT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 
ATGGACCTGC CATTGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 
CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 
CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 360 
TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 420 
ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 80 

45 CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 540 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 600 
AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 
CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 
CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 



660 
720 
780 
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CTGTATTTGA 


ATTAATGAAT 


GAAAAGATAA AACATCAAAT 


GCCAGGTGAA 


AGTGGGCTCA 


1500 




TTGCTCTTGA 


TTGGCATAAT 


GGAAATCGAA GTGTATTAAG 


TGATAGCAAT 


TTAACAGGTT 


1560 


5 


GTATCTTTGG 


ATTAACTTTA 


CAAACTAAGC ATGAGGaTAT 


TTATAGAGCm 


TATTTaGatAG 


1620 




CTACaGCATT 


TGGTACtAAG 


ATGATTATGC mACAGTATCA 


AGATTGGCAT 


ATGGaAGTAG 


1680 




aAAAGGtATT 


TGCaTGTGGc 


gGTAtACCtA AAAAGAATGC TGTTATGATG 


GATATCTATG 


1740 


10 


CGAATGTACT 


GAATAAAAAA 


CTAATTGTTA TGGATAGTGA 


GTATGCACCA 


GCAATAGGCG 


1800 




CAGCAATATT 


AGGTGCAGTC 


AGT 






1823 


IS 
20 


(2) INFORMATION FOR SEQ ID NO: 372: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 






25 


ACGATCATCT 


GCATCAGCGA 


ATTCCGATGC AaTTcATATG 


tCCtAATAAC 


AAAAGAATAG 


60 




GTTTTaAAAG 


ATATGCATAC 


CGTAATGaTA GATATAGTTT 


TAAACGTGAC 


TTCAAGCTAT 


120 




ATGaATGTGA 


TGACTGTTCA 


TCATGTTCTT TGAGACATCA 


ATGCATGAAG 


CCAAATTCGA 


180 


30 


AATCCAATAA 


GAAAATTATG 


AAGAATTATA ATTGGGAATA 


CTTTAAAGCC 


CAAATTAATC 


240 




AAAAGCTTTC 


TGAACCAGAA 


ACGAAAAAAT CTATAGTCAA 


AGAAAAATTG 


ATGTAGAGCC 


300 




TGTTTTTGGA 


TTTATGAAGG 


CTATTTTGGG TtTCACTCGA 


ATGTCAGTTC 


GGGAATAAAT 


360 


35 


AAAGTTAAAC 


GAGAGCTAGG TTTTGTATTA ATGGCACTTA ATATAAGGAA AATAgcaGCT 


420 




CAACGAGCTG 


TACATTATAA 


AATACATATC AAAAAAGCTG 


ATTTCTATCA 


AATAAATAAT 


480 


40 


AGAAATCAGC 


•ri-rriTACAT 


TGCCTAAGAA CTTTAAGGAA 


CTTAATGTCC 


CAAGCTCTTT 


540 


TTTGTTATAT 


CTAATTCGTA 


ATTTATGATT GTTTATTCGG 


TCCTTTGATG 


TTCACTAAAT 


600 




GTGACTTTAA 


ATCTTGTTCT 


AATTGTTGTA ATTCTTTTTC 


AGCTAATTGT 


CGTTCTTCGC 


660 


45 


GACCGTGTTG 


TTGAATAATT 


AATGTTTCTT CAATTGTCTC 


AATAATGTTA 


CGCTGTGTAC 


720 




GTTTCAATGT 


ATCAAGATCA 


ACAATGCCAC GCTCATTTTC 


TGTTGCAGTT 


TCAATCGCAT 


780 




TTTGTTTCAA 


CAl-TTCAGCA 


TTTGCTGTTA ATAAATCATT 


AGTTGTATCA 


GTGACAGCTC 


840 


SO 


GTTGTGCAGC 


AACTGCATTA 


CGCTGTCTCA TTAATGTAAG 


CGCAATGGCC 


ATTTGATTTT 


900 




TCCATAGTGG 


AATACTTGTC 


AAAATTGAAC TTTGTATCTT 


CTCGGCAAGT 


GCTTGATTAA 


960 
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(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 
ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 
GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 
TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 
TTAGCAGTTA ATGAAATGTT AAATGCAATA CT^AAATAAAT AATTAAATAT TAATTAGAGG 
20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 

PiPJKTP.QPJ^QO TATTATGAGT ATGATTTT^G AAAACGCTTG CPJkPAQTPJ^I: AACCGCAACT 
AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 
GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 
ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATGCAC 
TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 
AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 
TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 
CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 
ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTTAGTA 
GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 



TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTITCATT 
ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 
45 CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 
TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 
ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 



60 



1020 
1080 
1140 
1200 
1260 
1320 



CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 13 80 
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ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA TTTTTAGTTT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAC TTTTTTTTAA AGCTAAATTT ACATCAACTT 2040 

^ AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGGCX5ATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 222 0 

10 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACGCAAA AACATTTGAG 2280 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 234 0 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 24 00 

15 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 2460 

ATGTTGTTGG TGTTAAGTTT eiAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2518 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



30 



35 



40 
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ATACTACTGG 


ACCGTCnACC 


AATTGACAAA 


ACCTGAAAAC 


TGGATTTTTA 


TTGCAGGATT 


60 


ACATCGAACC 


TGGAAGAACA 


ATCCCCAGGA 


GTTGATGATA 


TTCGAGATAT 


GGCATACAAT 


120 


CAAGGTAGTT 


TAGATAAGAC 


AATTTATGAA 


ATTTCTAAAC 


GCACAGTACT 


ATTTTTAATA 


180 


CAGAAAGATA 


TTACGGTATA 


TAATAAGACG 


ATTGACTGTT 


TAAATTATTA 


TAACTATAGT 


240 


GACGAAAGAA 


TAAAGGATGA 


TTAAATGAAT 


TCACAAGAAT 


TATTAGCAAT 


TGCTGTGGAT 


300 


GCAATTGACA 


ATACCCCAGG 


CGAAGATACG 


ATTTCTTTAG 


AAATGAAAGG 


TATCAGCGAT 


360 


ATGACAGATT 


ATTTTGTTGT 


AACGCACGGA 


AATAATGAAC 


GACAAGTTCA 


AGCGATTGCT 


420 


AGAGCGGTGA 


AAGAAGTAGC 


CAATGAACAA 


AATATAGAAG 


TAAAACGTAT 


GGAAGGATAC 


460 


AATGAAGCGC 


GTTGGATATT 


AATTGACTTA 


GCTGATGTTG 


TGGTACATGT 


TTTCCATAAA 


540 


GACGAAAGAA 


ATTATTATAA 


TATTGAAAAG 


TTATATCAAG 


ATGCACCATT 


AGAATCATAT 


600 


AGTCAGGTTG 


CGTATTAATT 


ATGTCGCAAT 


ATGCAGAAAT 


GAGCCTAGTG 


TACGATCAAT 


660 


TGACTCAAGA 


TCAACCATAT 


GAAAAATGGT 


TTGAAATTGT 


AAAAAATCAC 


TGCAAAGATG 


720 


AATCAAATAT 


TTTAGATATT 


GGATGCGGTA 


CTGGTAGTTa 


ACAGTTCAAT 


TAGAAGCTTT 


780 
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GATAATTTCn 


AATGTAGCGA 


CCATCATTAC 


GAGATGATTT 


GATATAAGCA 


CAGTTTGGAT 


180 




GTTGACCAAT 


ACTATCGCCT 


TCTTCTTCGA 


TGATATCTAT 


TTTAATACCA 


TCATCAGCTG 


240 


5 


CAATTTCTAA 


TGAAGATTTA 


ATTCGGTTAT 


CAAATGTTGA 


ATATCCCATT 


GCTCCACCCA 


300 




CAATAGCGAC 


ATCTGTACCA 


TGTCCTTGGT 


GTGTTTGAGC 


AAATGATTCA 


TAATAATGTA 


360 


10 


TTTCAATATT 


TTTAGGAATA 


TCTCCCAATA 


TTGCGCGTGC 


TGAATTCCCA 


ATCTTTACTG 


420 


CACCAGCCGT 


ATGAGAACTT 


GAAGGGCCCA 


TCATAACTGG 


TCCGATAATA 


TCGAAAGCAC 


480 




TTTGATAATC 


ATAGCTCTTT 


GCCATAATTA 


AACACTCTCC 


TTAATATGAT 


TCTTTTTGCT 


540 


1$ 


CGGCATTTTA 


AAGTTGATAT 


TCATTAAATT 


AAACTTATTA ATTAGTGTTC 


CAATAATATA 


600 




GGCTAAGACA 


ATGCTGACAA 


TAATCACTGT 


TGCGATTGTT 


ACAATTGATG 


TCACTGCATT 


660 




ATTAAAGCCA 


AACAATACGA 


TGGCGCCTGC 


AATTGGTGTT 


GCCATACCTT 


TGACACCTAT 


720 


20 


TACTAGTCCG 


CTAAATGTCA 


CGATACATGC 


GTTGACGACC 


CCAATCAGTG 


CATTTGTACC 


780 




ATATAGTTGT 


ACTGGATATT 


GCGCTATTAA 


ATCAATTTGC 


GTCAATGGCT 


CAATACAAAC 


640 




TGCAAATGCT 


TTTGACGGTC 


CACCAATGTT 


TAATTTTCGG 


AATAAAATAA 


GGTTAACAAA 


900 


2S 


TGAGCTACCT 


GTACATGTTA 


GTGCTCCAAT 


AGCCATAGGA 


ACACCTGTCA 


GTCCTAATAA 


960 




ACTTGTTAAT 


ACCATTGAAC 


TTAGCGGTGT 


CATACCTGTA ACAGGAATCA 


CTAGTCCTAA 


1020 




AATGACCGCT 


AATGCATATG 


GATTGTTATC 


ACCTACCGCA 


GTGACAGCAC 


TACCTATTTG 


1080' 


30 


TTTTAATGTT 


GCTAGCACAC 


CAGGTGTAAT 


GATTGATGCA 


AGTCCGAAAG 


CAATTGCTGG 


114 0 




TGCAAATAAG 


ATCACCACAA 


TTAAGTCCAA 


GCCTTCTGGA 


ACTTTCTTTT 


CAATCCATTT 


1200' 


35 


AATTAAAAAA 


GcTACGCCAT 


AAGCTGCGAT 


GAATGCTGGT 


AATAATTTAA 


AGTCATGTAA 


1260' 


TACTAAACCA 


ACAATGACCG 


CAAATACTGG 


TGCAACGCCT 


AAGTTTAAGC 


ACGTTAGAAT 


1320 




' ACCTACTGcG 


ATACCGCTTA 


AACTTCCTGC 


TAAATCCCCA 


ATATCTTGTA 


GAAATTTAAT 


1380 


40 


ATCT^AATACG 


CCACCAATAG 


CATAACTTAA 


GAATGCTTGT 


GGTAGAAATG 


TCGCACAAGC 


1440 




TGCACCGGAT 


AATGCTTGTA 


GTCCTTGTTT 


ACCGTACGGT 


GCATACTTTA 


AAAATAGCGT 


1500 




CATGATCACT 


AAAACCAAGA 


CTAATGTGCC 


TACACCTAAC 


AGAATATCCA 


TTTCCCAAAA 


1560 


45 


ACCTCTCTCT 


ATGTTTATTT 


TATTTTCAGA 


CCATAAACAT 


CGTACACCCT 


TAAGAAAACG 


1620 




TTTTCAACTT 


TTATCTGTTA 


TCAAATCaAA 


TATTTAAGTG 


AAATATTTCA 


TATTTGTGAA 


1680 




AGATTTTTAA 


AATGGATTGT 


TTCAAAAACT 


ACTTATGTTG 


TCGTTAATAT 


TTACTAATTA 


1740 


SO 


ACTTTTTACT 


CTATATTTCA 


AACAGTTGTG 


TGACAGTTTT 


TTGATAACTT 


TTTTACATCT 


1800 




GAAAGTAAGT 


AATATTTCTA 


AAAACTTTTA 


ATATTTATAC 


ACTTTATCTT 


TCGAGCTATT 


1860 
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TO 



75 



30 



35 



40 



SO 



TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 1560 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 174 0 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 1800 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 1847 
(2) INFORMATION FOR SEQ ID NO: 368: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 60 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 12 0 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 360 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 420 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 4 80 

CCGTCCTCTG AAGT 4 94 
(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 9: 
GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 



55 
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10 



25 



30 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

AAGATTAGGC ATCAAGACAG GGTCGCGATT GTTTGAAATC CCACATAGnA ATGATATTTa 60 

CaTTATCAAT CCaAGTATGC GTAAATATCT TAATGTTTCA GTTGCTATTT CTAAGATTGC 120 

ATTGCGTTAT ATTCCACCTG AAGATTTACA CCAATATAGT ATTGACGAAT TTITTATCGA 180 

TGTTACTGAT AGCTATCATA GATTTAGTTC TACAGTACAT GCATTTTGCG AAAGACTTAA 240 

20 ACGTGAAATT TATGAAGAAA CAGGCATTTA TTGTACTGTG GGCATTGGTT CTAATATCrr 300 

ATTAAGTAAA ATTGCTATGG ATGTTGAAGC GAacATAGTc AAAATGGTAT AGCTGAATGG 36 0 

CGATATCaAG ATGTACCAAC GAAATTATGG CCAATTCmGC CCtTGCGAGA TTTTTGGGGT 420 

ATTAATCGTC GAACAGAAGC CAAATTGAAT AAAAGAGGAA TTTTTACTAT AGGAGATTTA 480 

GCGAAATATC CATATAAATT TTTAAAAAAA GAGTTCGGTA TTTTAGGTGT TCATATGCAT 54 0 

CTACATGCGA ATGGGATAGA TCAGAGTAAA GTACGTGAAA AGCACAAGAT CAGCAATCCA 600 

TCGATATGCA AAAGTCAAAT ATTAATGAGA GATTATCATT TTGATGAAGC AAAAGTAGTA 660 

ATGCAAGAGT TAATTGAAGA TGTTGCTAGC AGAGTTCGAG CAAGAAAAAA AGTGGCAAGA 720 

ACGATACATT TTGCCTTTGG CTATAGTGAT GAAGGCGGTG TACATAAGCA ATATACTTTC 780 

AAAGATCCAA CAAACTTAGA AAAAGATATT TATAAAGTAG TAAOXSCATTT CGCAGATAAA 840 

TTATGTAATA AACAAGCACT ATATCGTACG CTAAGTATAT CTTTCAGTCA ATTTATTAAT 900 

^ GAGGATGAGC GACgTTAAGT CTGTTTGAAG ATGAATACCA ACGCAAACGT GACGAATGTC 960 
TAGCTAAAAC GATAGACCAA TTACATTTGA AATACGGCAA AGGTATTGTG TCCAAAGCAG 
TATCGTTTAC AGAAGCAGGT ACAAAACACG GCAGATTAGG TTTAATCGCT GGACATAAAA 
45 TGTAATGACT ATACGGTTTA AGTAATATAT AACTGTGATT CGTATAAAAT AAGTCTCTAA 
AGATAAATAT TTCATATATC ACAATAGATT TTCACAATAA TATCTAAGAA TACATGGAAT 
TTATCAAAAG AGACTTAATA ATTATTGGAT ATAACAATCA AAATCACTCA ATGCTTGCAT 
ACCGCGTTCT CGGTCAGTAG GGTTTTTGAA ACTAATTTTT AAAGCACCGT ATATATCTTC 
GCGTACTTCT AAGATTCTTA AGTTGCTTAT AGATATGTTA TGTAAACTCA GGATATAAGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
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(Xi) SEQUEHCE DESCRIPTION: SEQ ID NO: 366: 





aaaAGAGACG 


CATTaAAACA 


AGCTATACAA 


ATTATCGATA 


AATTAACATG 


GGGTGTTTAG 


60 


5 


TGGTGGTTAA 


AGAAATTTTG 


AGACTATTAT 


TCTTACTAGC 


GATGTATGAG 


CTAGGTAAGT 


120 




ATGTAACTGA 


GCAAGTATAT 


ATTATGATGA 


CGGCTAATGA 


TGATGTAGAG 


GCGCCAAGTG 


180 


10 


ACTTTGAAAA 


AATCAGAGCT 


GAAGTTTCAT 


GGTAATAGCT 


ATTATCATTT 


TTGAATTAAT 


240 


TATATTAATG 


TGTGTAGCAA 


TAGCACTGGA 


GGTGTTGTAA 


ATATGTGGAT 


TGTCATTTCA 


300 




ATTGTTTTAT 


CTATATnTTT 


ATTGATCTTG 


TTAAGTAGCA 


TTTCTCATAA 


GATGAAAACC 


360 


15 


ATAGAAGCAT 


TGGAGTATAT 


GAATGCTTAT 


CTTTTCAAGC 


AGTTAGTAAA 


AAATAATGGT 


420 


GTTGAAGGTT 


TAGAAGATTA 


TGAAAATGAA 


GTTGAACGAA 


TTAGAAAAAG 


ATTCAAAAGC 


480 




TAAAGAGAGG 


CGTTGGCTTC 


TCTGCTCTAT 


CyAAAATAAT 


GAAAGGAGCC 


saACATGTTA 


540 


20 


GaCmAAGtCA 


CTCAAATAGA 


AACAATTAAA 


TATGATCGTG 


ATGTCTCATA 


TTCTTATGCT 


600 




GCTAGTCGTT 


TATCTACACA 


TTGGACTAAT 


CACAATATGG 


CTTGGTCTGA 


CTTTATGCAG 


660 




AAGCTAGCAC 


AAACAGTTAG 


AACTAAAGAA 


GATTTAACTG 


AGTACAATAA 


AATGTCTAAG 


720 


25 


TCTGAACAAG 


CCGATATAAA 


AGATGTTGGC 


GGATTTGTCG 


GTGGATATTT 


AAAAGAAGGC 


780 




AAACGGCGTG 


CTGGTCAAGT 


CATGAATCGT 


TCAATGCTAA 


CACTTGATAT 


CGATTATGCA 


840 




GCCCAAGATA 


TGACTGACAT 


ATTATCTATG 


TTTTATGATT 


TTGCATATTG 


TTTATATTCA 


900 


30 


ACACATAAGC 


ATAGAGAGAT 


AAGTCCAAGA 


CTGCGTTTAG 


TGATTCCTTT 


AAAACGAAAT 


960 




GTAAATGCAG 


ATGAGTATGA 


AGCTATTGGG 


CGTAAAGTCG 


CAGATATCGT 


TGGCATGGAT 


1020 




TACTTCGATG 


ATACAACTTA 


TCAACCACAT 


AGGTTAATGT 


ATTGGCCTTC 


AACtAGTAAC 


1080 


35 


GATGCGGAAT 


TTTTCTTTAC 


CTATGAAGAT 


TTACCTTTGT 


TAGACCCAGA 


TAAAATATTA 


1140 




AATGAATATG 


TTGATTGGAC 


TGACACATTA 


GAATGGCCAA 


CGTCTTCAAG 


GGAAGAGAGT 


1200 


40 


AAGACTAAAA 


GATTAGCAGA 


TAAGCAAGGC 


GACCCAGJVAG 


AAAAGCCGGG 


AATTGTTGGT 


1260 


GCATTTTGTA 


GAGCCTATAC 


GATAOAAGAA 


GCTATAGAAA 


CrrXTATTCC 


TGATTTATAC 


1320 




GAAAAACATT 


CTACTAACCG 


TTATACCTAT 


CATGAAGGTT 


CAACTGCAGG 


TGGATTGGTG 


1380 


45 


TTATACGAAA 


ATAACAAGTT 


TGCCTATTCT 


CATCATAATA 


CGGATCCCGT 


AAGCGGTATG 


1440 




CTTGTGAACA 


GTTTTGATTT 


AGTACGCATA 


CACTTATATG 


GTGCTCAAGA 


TGAAGAAACT 


1500 




AAAACAGATA 


CTCCGGTTAA 


TCGACTACCT 


AGTTATAAAG 


CAATGCAGCa 


AAGAGCGCAA 


1560 


50 


AATGATGAGG 


TTGTTAAAAA 


GCAATTAATT 


AATGACAAAA 


TGTCTGATGC 


AATGCAGGaT 


1620 




TTCGATGAAn 


GAGAAAATAG 


CGATGATGCA 


TGGTCTGAGA 


CGTTnGAAAT 


TACTTCGAAA 


1680 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiCXSY: linear 



5640 
5700 
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TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 5400 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 5460 

TTACACAAAT TGCATTGTAT GGCAGAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 552 0 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 5580 
TCITGrrCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 
AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 5760 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 582 0 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 5880 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 594 0 

20 TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 6000 

TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTTAACrrrc CACTTGAAAG 6 06 0 
GTAAGTTTTA TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 
AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CGACCATAAT 624 0 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 63 0 0 

ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 636 0 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTTTTA 6420 

GTGCAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AGTAAATAGT 64 80' 

TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 654 0 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 6600 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 6660 

CAGATGAAAA TAGCAACTCT GT^CAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 6720 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6780 
45 ATTGTGACTG ATAAATCACA ATAT 

(2) INFORMATION FOR SEQ ID NO: 366: 



6120 
6180 



6804 



55 



WSOOCID: <EP 078651 9A2_I_> 



1258 



EP0 786 519 A2 





ATTTTGTTAC 


GCCTGTCTTA 


TCTAAAGCTT 


TTACTAAAAG 


GTAAGATAGG ACGCCGGCGA 


3600 




CTACTGCACC 


ACTAATTAAT 


CTAAATACGA 


TGAATAATGT 


TAAGTTCCAA CCTGCAACTT 


3660 


5 


CATTTAAATA 


ACCATAGAAA 


TAATCTATCG 


GGAAAGCCGC 


GATTGCTGTA CAAAAACCTG 


3720 




CTAACATAGC 


TACCATAACT 


GAACGTGATT 


GATATTTAAA 


AATTGCAAAG ACAAGTTCAC 


3780 


10 


ACGCTAAACC 


TTGTATAAAA 


GCGTAAACGA 


TTGTCGGAAT 


ATCGAAACGA CCCATAATGA 


3840 


TAGTTTCGCC 


GGCACCTGCA 


GCAAATTCAG 


CCAGTAAAGC 


AATACCTGGT TTTGGAATAA 


3900 




TTAGATAGCA 


GACAATCGCT 


GCCATGAACC 


AAACCCCGTT 


TGTTAATTGT TCGAGGTGAA 


3960 


IS 


GGCCTGTAGC 


TTGCACACCA 


TTGTAAACAA 


ACCACCATAA 


ATTGTAAATA ACTGCGAATA 


4020 




CTACTGAAAT 


AAGTACGGTT 


ACTAGTATTT 


CAGATAGCTT 


TAAACCTTTT GACATTTTTA 


4080 




CATCCTCCTA 


ATAAAAAAAC 


GCACTVACCAT 


CCATAGGAAA 


GTTATGCGTT CACAATATAT 


4140 


20 


ATTAGTAAAA 


CATATGTATA 


GTAACACTTT 


CCTACGCTAG 


TTCAAGCTAG ATCAGGTTCA 


4200 




AAGGGTTTGA 


GGGCAAGCCT 


CATCTCAGTA 


TAAAACACCC 


CTAGTGTGTG CGATTTATTT 


4260 




AATTAATTAT 


ACTGTAAGAC 


GTTTGTAAAC 


TTATGTCAAT 


AGGTTGTCTT CATGAAATTT 


4320 


25 


OGTTTAATTC 


GATTTAAAAT 


TTATAATATT 


AGCATTGGAT 


TTAAATTGAA GATGTAGTAG 


4380 




GAATGTTAGT 


AATTAAAGAT 


ATAAAAATAT 


GTGACATGTA 


ATAATATTGA GCTGATAAAT 


4440 




GAAGAGGGAT 


ACTTATCAAT 


CATACCTCTT 


TAACAACAGT 


GAAGAACCCG TGCATAATGG 


4500 


30 


CTTACGAATT 


ATAGTTTATA 


AGGAAGAAGA 


GGGATACATG 


CGCCGAGCAC ATGCATAAAA 


4560 




GCCCCTAACA 


ACTAAAAGTT 


GTAAGGAAGG 


AGAGGGATAC 


ATGCGCCGAG CACATGCATA 


4620 


35 


AAATCCCCTA 


ACAACTAAAA 


GTTGTAAGGG 


GATTTAAATT 


AATTTAGTGT ATCTTGGATA 


4680 


TCTTGTTTTG 


kTTGaTTAAT 


ATCTTCTGTT 


TTTTCTTCTT 


TTTTATCTTT TAATTTTTCT 


4740 




TCAACTTCTT 


TAGCTTTTTC 


TGCTGCTTTT 


TTATTTTGAT 


TTTCATTAGA CATGATTAAT 


4800 


40 


TCCTCCCAAA 


TTGGATAATT 


ATTTATATAT 


AAATCTTACC 


CGGTTGTACT TTCGTTAAAC 


4860 


TTTTCTAAGT 


CTATAGCACT 


ATTTATTCAT 


TTATCTAAAG 


ACAACAACAT TAGATTAATA 


4920 




TATAATGATT 


TTGAGGTGAA 


CATAATGTCT 


TTTCTTAGGA 


AACACGCCGA AATTATTTTT 


4980 


45 


AGCTATTTAA 


TCGGTtnwCGT 


TcACTCTTCA 


CTGGkcTCAT 


TATTTTAATT AACTTGCCAT 


5040 




TAATTAAACA 


ATTAAATGGT 


GGTAAAAAAG 


TTGATACACA 


TGTTCATAAT GTGTGGGAAT 


5100 




TTCTGAATGC 


ATTTTTCAGT 


GAAATTATTA 


AAGTAATGAG 


TCGATTTATA GGTAATTTCC 


5160 


50 


CtATAGTTAG 


TGCAATTGTG 


ATAATTATAT 


TCGGTATTTT 


AGTTATGTTG ATTGGTCATA 


5220 




CATTACTTAG 


AACTATTAAG 


TATGACTATG 


ATATTTCTAT CTTTTTCTTA GTTATCGGTA 


5280 
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ATGAAATAGC AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTCTTCTAA 






TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATCAACA 


•1- D O U 


5 


GGGAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTrTAAACT 






GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AGCATAAAAT 


1960 


10 


CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCX3TTA 


204 0 


TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 


2100 




CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 


2160 


IS 


ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTGTC CCAAATTAAT TCGCTTTTGG 


2220 




AAAAGTTTGA TCAACTGGAA TGTATTATGG CTATCAAGTC CAAATGTCGG TTCATCTAAA 


2280 




AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TTTTTGACCA 


2340 


20 


ATAGACAACT CATAAGGATG TTGATCTTTT ACATTTTGTA AATCTAAAAG TTTTAAAAGT 


2400 




TGTATCGTTT CATCATCACT TTGATCTTTA GAAAGGTGAT TAAAATGAAT GTrAATTTCA 


2460 




TCATAAACCG AATTTGTTAT AAATTGTAAT TCTGGGTTTT GATAAACTAG GTACATGTGT 


2520 


25 


TTTGCTGCAT GTTTAATTTT TGTTAAACGC TGATTTTCAA AATAAACATC ACCTTGATAT 


2580 




TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTGTA 


2640 




ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATGAGAGCAA TGTTGATTTA 


2700 


30 


CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTGTG AGTTTGTTGT TGGAAAGTCA 


2760^^ 




ACACGACTTG GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 


2 82 0^ 


35 


TGAACATACT TCTGTAATAT GATTTCAGGA CATTCATCGG CAATGATATT TCCGTTATAA 


2 8 8 0'^ 


TCCATCAAAA TGACGCGGTC GACATGATTC CAGATGTGTT TAACTTTATG TTCAACGATT 


2 940 




ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAATCTTC TGTTGCTTGA 


3000 


40 


ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTGATTGTTG AAGAATGGTT 


3060 




TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTTTGATATA CGTTTCAGGG 


3120 




GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACGCATCCAT GTCTTCACGT 


3180 


45 


GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTOGT ATACTTTTGG CATACAAAAC 


3240 




TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 


3300 




TTCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AAGTGTACTT 


J J ou 


50 


TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCTT TGTCTTGAAT AGTGATATTT 


3420 




AAATGATCGA AAAll'lTACG TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACTTIT 


3480 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

TGATGGATTA GCAGACATTT TACGAGCGAA TGGTTTCAAA GTGTTTGGTC CAAATAAGCA 60 

^ AGCAGCTCAA ATCGAAGGCT CAAAATTATT TGCTAAAAAG ATAATGGAAA AATATAATAT 120 

TCCAACTGCr GATTATAAAG AAGTTGAGCG AAAAAAGGAT GCTTTAACAT ATATTGAAAA 180 

CTGTGAATTG CCCGTTGTTG TCAAGAAAGA TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 24 0 

10 

TGCAGATACT ATTGAAGCAG CCAGAAGTGC TATTGAGATT ATGTATGGTG ATGAAGAAGA 300 

AGGTACTGTT GTATTTGAAA CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 360 

TAATGGTGAT TTAGCAGTAC CTTTCGACTG TATTGCACAA GATCATAAAC GCGCATTTGA 420 

IS 

TCATGATGAA GGACCAAATA CTGGTGGTAT GGGGGCTTAT TGTCCmgTAC CACATATTAG 480 

TGACGATGTT TTAAmACTTA CAAATGAAAC AATTGCACAw CCCATTGCAA AGGCAATGCT 54 0 

2^ TAATGAAGGT TATCAATTCT TCGGTGTATT ATACATTGGT GCTATTTTT^ CTAAAGATGG 600 

TCCAAAAGTA ATAGAATTTA ATGCCCGTTT TGGTGATCCT GAAGCTCAAG TATTATTAAG 660 

TCGCATGGAA AGTGATTTAA TGCAGCATAT TATTGATTTA GATGAAGGAA AACGTACTGA 72 0 

25 ATTCAAATGG AAAAATGAAT CTATTGTAGG GGTCATGTTG GCATCAAAAG GATATCCTGA 780 

TGCATATGAA AAAGGGCATA AAGTAAGTGG CTTTGATTTA AATGAAAACT ATTTTGTTAG 840 

TGGATTAAAG AAGCAAGGTG ATACCTTTGT TACTTcAGGT GGTAGAGTTA TACTTGCCAT 900 

CGGAAAAGGT GACAATGTAC AAGATGCACA GCGAGACGCA TACAAAAAAG TATCACAAAT 960 

ACAAAGTGAC CATTTATTCT ATCGTCATGA CATTGCGAAT AAAGCACTAC AACTTAAATA 1020 

AGTAAATTTA AAATACTAAG aTTAGCTATG AACGAATCTA TAACGATAGA TTTTTTCATA 1080 

35 

GCTTTTTTAG TTGTAGAGTC TAGGACATTG ATTTCTGTAC CAAATTTGTG ATTATGCATA 114 0 

TGTAATACAA AAGAGGCGCC ACAACATGTT TGGATGAACA AAATAACATG TTTGTGGCAC 1200 

CTCTTTTGTT TAGTATGGAA TAAATGGTTT TCTTTTTCTA TACAATGAAT TTCTAATTTA 1260 

40 

GTATCTATAC AATTATGGAT AAAATTTAAC CTACACGACC AAGACGAACA TCATCTATGC 1320 

CCGTGATGGG TAAGGTGATT GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 1380 

TAATAAAGAT TATATCTTTA TATGAGAAAG GTACGTTGTA ATAGTAAGTA CGAGGACCAT 1440 

CTCTAAATCC TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 1500 

TTAATAGAGG TATAATTAAA TGCTTAAATC GCTTTAACCC TCTATAATTT GCCGCGTCTA 1560 

50 TCATCTGATA GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG 1620 

GTATCATACG AATGGCAGCC ATGAATGCAT AAGCAACTTT TGATTTAACC TTTAAATGTT 16 80 
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AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTAC7VCC AGCAATAATC 
CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 
ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 
ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT 
ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTr ATCTTCATCT 
GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 
TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 
TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 
ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 
ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 
20 CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTrTATC 
CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 
2S tiACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAGAAATC 
GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 
CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 
AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 
ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 
ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 
TTATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 
CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 
TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 
AAATTGTAGC TCGCAATACT TTATTTAAAT AATTrTCATT TTTCATGTAA AATTTGTGAC 
45 ATTGCAAAAA TGT 

(2) INFORMATION FOR SEQ ID NO: 36 5: 



30 



35 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



IS 



20 



25 



30 



35 



40 



45 



50 



GKCPiCPACPiG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 7680 

GAAATCXSAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 7740 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7800 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7860 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT 7920 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 7980 

TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 804 0 

GAAGGCTATG ATGAAGAATT AGGTGCACGT CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 8220 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA 82 80 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 834 0 

TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 84 00 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 84 60 

TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8520 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 8 58 0 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 864 0 
TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CATCGAACTT 8700 
ACGATAACTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8760 
AATCCXJAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 8 820 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8 880 

CCX3CCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 894 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 9000 
TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 906 0 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 912 0 

AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT 9180 
GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 924 0 

CCAAATCCGA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTCCATAATA 9300 
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ATAAGTCGTC AATTACXSTAT 


ATAAACACGT 


AATACCAGCT 


ATCACTTTGC 


TGCAATATAC 


5820 


5 


AGTTACATAT CTTACTACAC 


GTGCTAACCT 


CTTACTTTGT 


AAACCAAATC 


TTAAATTAAA 


5880 




ATATTGAAAA 


TGCAATGAAT 


CCTTAATATT 


TTATTAAACC 


TATAATTACT 


TATTAAAAAT 


5940 




AACACACAAT 


ATTCATAAAG 


TTTTAAAAAT 


ATTCTGTTTT 


ATCACCTACT 


ATTAGTGGAA 


6000 


10 


AAGTACAATT 


GCAATTGTAT 


ATAGTTTGCA 


TAACGCTTCA 


AAAGTAATTT 


CTTTTTTGTT 


6060 




TAGTTCAAAA 


AAATTTAGAG 


GTGATGTTAT 


ATGAATAACG 


GTTTTTTCAA 


TAGCGACTTT 


6120 




GATTCAATTT 


TTCGAAGAAT 


GATGAAAGAT 


ATGCAAGGTT 


CAAATCAAGT 


CGGAAACAAA 


6180 


IS 


AAGTACTATA 


TTAATGGTAA 


AGAAGTTTCA 


CCTGAAGAAC 


TAGCGCAACT 


CACACAACAA 


6240 




GGTGGCAATC 


ACTCTGCTGA 


ACAAAGTGCG 


CAAgcTTTTC AACAAGCAGC ACAAAGACAA 


6300 




CAAGGGCAAC 


AAGGTGGCAA 


CGGCAATTAT 


TTAGAACAAA 


TTGGTCGTAA 


CCTTACGCAA 


6360 


20 


GAAGCACGTG 


ACGGTTTATT 


AGATCCAGTC 


ATTGGTCGTG ATAAAGAAAT 


TCAAGAAACT 


€420 




GCTGAAGTTT 


TAAGTAGACG 


AACTAAAAAC 


AATCCTATAT 


TAGTTGGAGA 


AGCTGGTGTT 


6480 


25 


GGTAAAACTG 


CGATTGTTGA 


AGGTTTAGCA 


CAGGCAATCG 


TTGAAGGAAA 


TGTACCAGCA 


6540 


GCAATCAAAG 


ACAAAGAAAT 


TATTTCTGTA 


GACATTTCAT 


CATTAGAAGC 


TGGAACGCAA 


6600 




TATCGTGGTG 


CTTTTGAAGA 


AAATATTCAA 


AAATTAATCG 


AAGGTGTTAA 


ATCTTCACAA 


6660 


30 


AATGCCGTAC 


TATTCTTTGA 


TGAAATCCAT 


CAAATTATCG 


GTTCAGGTGC 


CACAGGAAGT 


6720 




GATTCAGGTA 


GCAAAGGGTT 


ATCTGATATT 


TTGAAACCTG 


CATTAAGTCG 


TGGTGAGATT 


6780 




TCTATTATTG 


GTGCAACAAC 


ACAAGATGAA 


TATCGAAACA 


ATATTCTTAA 


AGATGCTGCA 


6840 


35 


TTAACGCGCA 


GATTTAATGA 


AGTGCTTGTT 


AATGAACCAA 


GCGCTAAAGA 


TACTGTTGAA 


6900 




ATTTTAAAAG 


GTATTCGCGA 


AAAATTCX3AA 


GAACACCATC 


AAGTAAAATT 


ACCAGATGAC 


6960 




GTATTAAAAG 


CATGTGTTGA 


CTTATCAATT 


CAATATATTC 


CACAACGATT 


ATTACCAGAT 


7020 


40 


AAAGCAATCG 


ATGTGTTAGA 


TATTACAGCA 


GCACATTTAT 


CTGCGCAAAG 


TCCAGCTGTC 


7080 




GATAAAGTTG 


AAACTGAAAA 


ACGAATTTCT 


GAATTAGAAA 


ATGATAAACG 


TAAAGCAGTA 


7140 




AGTGCTGAAG 


AATATAAAAA 


AGCTGACGAC 


ATTCAAAATG 


AAATCAAATC 


ATTACAAGAT 


7200 


45 


AAATTAGAAA 


ATAGTAATGG 


TGAACATACT 


GCTGTTGCTA 


CAGTTCATGA 


TATTTCAGAT 


7260 




ACTATTCAAC 


GATTAACTGG 


TATTCCAGTT 


TCTCAAATGG 


ATGATAACGA 


TATTGAACGT 


7320 


50 


TTAAAAAATA 


TTTCTAATCG 


TTTAAGAAGT 


AAAATCATAG 


GTCAAGATCA 


AGCTGTAGAA 


7380 


ATGGTTTCAC 


GTGCAATTCG 


CCGTAATCGT 


GCTGGGTTTG 


ATGACGGCAA 


CCGTCCAATT 


7440 




GGCAGTTTCC 


TATTTGTTGG 


CCCTACTGGT 


GTTGGTAAAA 


CAGAGCTTGC 


TAAACAATTA 


7500 
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CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4020 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4080 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 4140 

AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATTAGCTAAA GATTATTTAG 4200 

CAACTAGACC GAATCAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATCGATTGA 426 0 

ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGGTTATT GCACATAATC 4320 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 4380 

GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 444 0 

TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4500 

ACTTCX3CATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 45S0 

20 CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCGTTT ACGTTCAGGA TATGAAGATG 4620 

CTCTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGC ATCATTATAT TTAAGCCTAA 4680 

TATCATTACT TCAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTCAGTTATG 4 74 0 

GCTCAGGTTC AGTTGGTCAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4800 

TAGATCAAGC TGGACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4860 

ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4920 

ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4980 

GACCAGAGTA GTCGGTX3TAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT CCTTTATACG 5160 

TTACGGCTTC GTCCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 5220 

TAACAGCTTA ATTTTATTAT TAATCX3TTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 5280 

GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 5340 

CGCITGGGCT GACATTTTTG GCrTOTTAAG CAGCTTGCCT ACTTTTTrGG CAATAGCACC 5400 

^5 ATACGTTC1T AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 5460 
ATGACTACCT GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC 
GTCTAACCAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC 
*° ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 
CAATCGTCCT ACAGGCGAAT CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 
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AAAACTTCTA TCACGAACGC 


TTCAATTTCC 


TGTATGCGTG 


TGATAGAAGT TTTTATTTTT 


2220 


5 


GTTCATATTA ATTTATCTAA 


GCGCTACGAT 


GGaCTGACTT 


AAACTTTTCT 


TTAAAATCGC 


2280 




TATTCGCCAT 


TTACTATTGT 


TGTCTAATTT 


CTTGTAAAAT 


ATGTTCCX5CT 


GCTTGTGTAT 


2340 




TTGCACGGGG 


TTCTTTTTTC 


AAAGCTTCAG 


CTACTTTAGC 


AATTTCATCA 


CCTTTTGCCC 


2400 


10 


CTACAACGAT 


AGCTAATGAT 


TTATATTGTA 


AACTCATATG 


ACCTTGTTGA 


ATACCTTCTG 


2460 




ACACAAGCGC 


GCGACATGCT 


GCAAAGTTTT 


GCGCTAAACC 


AACGGCAGCA 


ACTACATGAC 


2520 




CTAATTCTTG 


TGCTGACTCT 


ACATTTAGTA 


GCTCTAATGA 


AGCTTTAGCA 


ATTGGTAATA 


2580 


IS 


CTTTTGTACC 


ACCGCCT^CG 


ATTGCCAATG 


TCATAGGCAC 


TTCAATTGTA 


CCAATCAATC 


2640 




GTTGACGATC 


TTGATCGTAA 


CGCCATGTAG 


CAATACCACG 


ATACTGTCCG 


TCACGACTCG 


2700 




CGTATGCATG 


CGCACTTGCT 


TCTGCACCAC 


GCGTATCATT 


TCCTGTTGCT 


AAAACAACAG 


2760 


20 


CATGTATGCC 


ATTCATAACA 


CCTTTATTAT 


GTGTTGCTGC 


ACGATGT^TA 


TCTACTTGGG 


2820 




CCAATACAGA 


AGCACGTTCC 


ATTCGTTTGG 


CAACCTCTTC 


TCCAGTTCTC 


TCGCCCCTTG 


2880 


25 


CTAAATCTTT 


AACATCAATT 


TCGCCTTGAA 


CTTTAACAAC 


GGACGCTGTT 


GCATGATTGG 


2940 


ATAAAATACT 


CATTAAAATG 


TCGCTTTGCG 


GAAATTCATT 


TTTTAAAAAT 


GCAGTTATGG 


3000 




CCTCTAAAAT 


CGTATTAAGC 


ATATTAGCGC 


CCATAGCATC 


TTTCGTATCA 


ACAAATACTT 


3060 


30 


TTAAAGATAG 


TAACTGTTGC 


TCAGGAAATG 


TATCAATCGC 


TATACGTTGG 


TAACCACCAC 


3120 




CACGCGCTTT 


AATAGAAGGA 


TATGCCTCAT 


CCGCAATTTT 


ATGAATTTGC 


TTTTCTAAAG 


3180 




CTTTAATGTC 


TGCTGATAAT 


TTTTCAGTAT 


CGTCAACGCC 


ATCAAAGACG 


ATTTGACCTA 


3240 


35 


TCATAATACG 


TTCAGAAGAT 


ACCGTTTTAA 


ATCCGCCAGT 


CTGATTCACT 


AGCTTTGCAC 


3300 




CATAACTAGC 


TGCAGCGACA 


ACTGAAGGCT 


CTTCCACCAT 


CATAGGTACA ACATATGCCT 


3360 




TATCGTCCAC 


AATGATATTC 


GGTAATAATC 


CAACGGGTAA 


TGCACCTTGC 


GCGATGACAT 


3420 


40 


TTTCAATTAA 


ACTATTGGCT 


ACTTCTTCAT 


CGATTAATGG 


ATGATTCAGT 


AAAATGTCGA 


3480 




ATTGTTCTTC 


TGATAACCAT 


TGCTTATCAA 


CCAATTGTTG 


TAACTTTTCT 


TTACGAGATA 


3540 




AATGTCGAAA 


ATTCTTATCT 


AAATTTTGCA 


TGGACGTACT 


CCTTTTACTT 


CACATAATTT 


'3600 


AC 


TTAACATTTC 


AATCACTACT 


ATTTTTACCA 


CTW^TAACG 


TCATTCGTCT 


TAAAATTCAA 


3660 




TTGAATAATT 


GTCGTTTTGA 


CTTTAAAATA 


AAACAAGGTA 


AATTAAAACG 


CTTACAAGAA 


3720 


SO 


ACGACAAATc 


ATTTTTAAAT 


TTAGTATATT 


TCTTTGTATA 


AAATTAGCAT 


ATTCTGATAT 


3780 




GATACAAGTG 


TTGCTTTTAT 


AAATTTGAAA 


GGATGTAAAA 


CCTTATGACA 


ATAGGTATCG 


3840 




ATAAAATAAA 


CTrri'ACGTT 


CCAAAGTACT 


ATGTAGACAT 


GGCTAAATTA 


GCAGAAGCAC 


3900 
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CAATCGCATG CCGCAGTTAA TTATTATAGT PiPJiPiPiCCAAT GTACATGGTG GGCATTTAAA 420 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT €00 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 780 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 840 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATTGTA 1080 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 114 0 

25 CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AACATTATAC TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

45 GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 2040 

TGAAGAATCT AAGACTGTTA AAATCGAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 

55 



30 



35 
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10 



IS 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

ATCAAAATAT TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 60 

ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 120 

GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 180 

AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 24 0 

AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAATCTTG 300 

GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 360 

TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

AA-TTTTGTAA CCACGAAAGA TGGTTGCAGC OTTACGATTT AATTTCTGAA CGTAATCATT 480 

TCAATTTTAT TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 54 0 

2s TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 66 0 

CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 720 

ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 780 

TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 84 0 

AAGGTGAATT AGGGGTCATC TAACATT Qg7 
(2) INFORMATION FOR SEQ ID NO: 364: 



20 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 
"^0 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 <5Ci) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 6 0 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

SO AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 24 0 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 300 



SS 



iNSOdaO: <£P_0786519A2J_> 



1248 



EP 0 786 519 A2 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3S2: 
TACATGnrC GGATGCTACT TTATTTAGTT TGAAGGGTGC ATTATGGACG TTAGCGCAAG 
AAGTTTATCA AGAATGGTAT TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 
CACGAACTAC TTTTAAGACA GGTTATATTT ATCAAGAAAT TATTXTGAGA CCAGTAGATG 
AAGTTAAGGT ACTTCTGAAT GATTTAAAAG GTGCTGGTTT CGAATTAGGT ATTGCAACAG 
20 GTCGTCCTTA TACTCAGACT GTTGTGCCAT TTGAAAATTT AGGATTGTTA CCATATTTTG 

AAGCIGATTT TATTGCAACA GCAAGTGATG TTTTAGAAGC AGAGAATATG TATCCGCAAG 
CACGACCATT AGGAAAGCCG AATCCTTTTA GTTATATCGC AGCTTTATAT GGTAATAATC 
GCGATAAATA TGAATCTTAT ATCAATAAGC AAGATAACAT TGTAAATAAA GATGACGTAT 
TTATAGTAGG CGATTCGTTA GCTGACTTAT TAAGTGCTCA AAAAATAGGT GCAACGTTTA 
TTCGAACATT AACAGGTTTA AAAGGTAAGG ATGCTGCAGG TGAGTTAGAA GCGCATCATG 
CCGACTATGT TATTAATCAT TTAGGTGAAC TTAGAGGTGT ACTAGATAAT TrGTAATTTG 
ATTGTTGrrr GACAGCATAA CTTGTAGTGA ATGATTGAAC CAAAGGTTTC ATATTGAOTT 
ACAATGAAAT TAATAATGAA AAAATGCCAA GAAGCAATGG AAGTAATCCA ATGTCTTCTT 
GGCATTTTGA ATTTACATAA ATTCTTTATC ACTGTACCGT CAATTCAGTT GTGAAAATTT 
GATTGTATTC ACCAACTTGT TTAAGTTCAT CAATTATATT GTTTGAAACA GGTTGATCAA 
CGGATAAAAT CATTAGCGCA TCTCOGCCCG CTTCAGTTCT ACCTAAAGTC ATAGATGCAA 
TGTTGATATr GTATTTACCT AACAATGCGC CAGTTrTTCC TACCATACCT GGAGTATCAT 
TATGATATGA CACAATTTGA TATTGATTTG GCTTTAAGTC TACAGAAAAA TTATTAATTC 
TAACAATTCT TGGACCGAAA CCTGTAAAGA CAGAAGCGCC AACTTTAACG GAATCGCnAT 
CGHTTGATAG TTCTACCTCT AAGTAGTTAC TAAAACCTGT CTCTGCTTTA TTATTTTCAA 
TATTTAATGT CACTTG 

50 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 





(Xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 


361: 






10 


TCGTTGAGTA AAAGTCCAGA AAATTGGATG 


AGT7VAACTTG 


ATGATGGAAA 


ACATTTAACT 


60 




GAGATTAATA TACCGGGTTC ACATGATAGT 


GGCTCATTCA 


CTTTAAAGGA 


TCCAGTAAAA 


120 


IS 


TCAGTTTGGG CAAAGACTCA AGATAAAGAT 


TACCTTACCC 


AAATGAAGTC 


GGGAGTCAGG 


180 


Tl-rl-riX5ATA TTAGAGGTAG AGCAAGTGCT 


GATAATATGA 


TTTCAGTTCA 


TCACGGCATG 


240 




GTTTATTTGC ATCATGAATT AGGAAAATTT 


CTCGATGATG 


CTAAATATTA 


CTTGAGTGCT 


300 


on 


TATCCAAACG AAACAATTGT GATGTCTATG 


AAAAAGGACT 


ACGATAGCGA 


TTCTAAAGTT 


360 




ACGAAGACAT TTGAAGAAaT TTTTAGAGAA 


TATTATTATA 


ATAACCCGCA 


ATATCAGAAT 


420 




CTTTTTtACA CAGGAAGTAA TGCGAATCCT 


ACTTTAAAAG 


AAACGAAAGG 


TAAAATTGTC 


480 


2S 


CTATTCAATA GAATGGGGGG TACGTACATA 


AAAAGTGGTT 


ATGGTGCTGA 


CACGTCAGGT 


540 




ATTCAATGGG CAGACAATGC GACATTTGAA 


ACGAAAATTA 


ATAATGGTAG 


CTTAAATTTA 


600 




AAAGTACAAG ATGAGTATAA AGATTACTAT 


GATAAAAAAG 


TTGAAGCTGT 


TAAAAATTTA 


660 


30 


TTGGCTAAAG CTAAAACGGA TAGTAACAAA 


GACAATGTAT 


ATGTGAATTT 


CTTGAGTGTA 


720 




GCGTCTGGAG GCAGCGCATT TAATAGTACT 


TATAACTATG 


CATCACATAT 


AAATCCTGAA 


780 


35 


ATTGCAAAAA CGATTAAAGC AAATGGGAAA 


GCTAGAACGG 


GTTGGCTGAT 


TGTTGACTAT 


840 


GCAGGATATA CGTGGCCTGG ATATGATGaT 


ATCGTAAGTG 


AAATTATAGA 


TAGTAATAAA 


900 




TAAGGATTCA ATAATGATAT TAAGACGAGT 


ATGAAAATAG 


TTAGATTCTA 


ATTATTTTCA 


960 


40 


CTACTCGTTT TTATTTTGAA AATAAGTAAT 


AATTCAACAA 


TATTATAAAT 


TGAACAGATT 


1020 




GTTTGTGAAA TTTTTGATAA TATTAAAGTG 


AAAAAGTGTT 


ATAAATTGAT 


AAATATATGT 


1080 




AATTAACAAA AACAAATCAT TTTAAAAAGA 


AGAGAGTTGT 


AAGATGATGa 


AACGATTAAA 


1140 


45 


CAAATTAGTG TTAGGCATTA TTTTTCTGTT 


TTTAGTCATT 


AGTATCACTG 


CTGGTTGTGG 


1200 




CATAGGTAAA GAAGCGGAAG TTAAGAAAAG 


CTTTGAAAAA 


ACATTGAGTA 


TGTACCCTAT 


1260 




TAAAAATCTA GAGGATTTAT ACGATAAGGA 


AGGCTATCGT 


GATGATCAGT 


TTGATAAAAA 


1320 


50 


TGATAAAGGT ACATGGATTA TAAATTCTGA 


AATGGTTATT 


CAACCTAATA 


ATGAAGATAT 


1380 




GGTAGCTAAA GGCATGGTTC TATATATGAA 


TAGAAATACC 


AAAACAACAA 


ATGGTTACTA 


1440 




mTATGTCGAT GTGACTAAGG ACGAGGATGA 


AGGAAAACCG 


CACGACAATG 


AAAAAAGATA 


1500 



55 
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GTGGATTGGA 


TTATTATACA 


CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 


960 




ATGGTGcCAT 


TACAACGCTT 


TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 


1020 


5 


ATGGTCCAAG 


TGAAACAGGT 


ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 


1080 




TTGAAGAAGA 


AGGTATCGAA 


TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 


1140 




TGGGTGATCA 


AGCAGATCGA 


TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 


1200 


10 


TTAAAGCAGA 


TAAAGACTAT 


TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 


1260 




GTTTAGGTGC 


CAAGTTTACA 


ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 


1320 


15 


ATGTTAAAAA 


TATGACAACT 


GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 


1380 


ATTTTAAGAA 


GTAGAGAGGG 


CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 


1440 




TTACTGAGGC 


ATTTTTAGGA 


CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 


1500 


20 


ACCTTGGTGG 


ATtGATTTtC 


GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 


1560 




TTAATCCTGC 


ATTTTCAGAA 


GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 


1620 




TTGTAGAAGT 


TCAAGGTACA 


GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 


1680 


25 


AAACTGGCCA 


AGTTGAAGTA 


CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 


1740 




CACCATTTTC 


TATAAATGAA 


GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 


1800 ' 




GTTATTTAGA 


TTTACGTCGT 


CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 


1860 


30 


CACGTTCTAT 


TCGTCAATAT 


TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 


192 0 




TAACX5AAGTC 


AACACCTGAG 


GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 


1980 


35 


GTGAATTTTA 


TGCATTACCA 


CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 


2040 


GATTTGACAA 


ATACTACCAA 


ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 


2100 




GTCAACCTGA 


ATTTACACAA 


GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 


2160 


40 


TGCAAATGGG 


TGAAGAAATG 


CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 


2220 




ATGGCGCTTT 


CCCACGCATG 


ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 


2280 




CAGATACACG 


TTTTGAAATG 


GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 


2340 


45 


TTAAAGTATT 


TAAAGATACT 


GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 


2400 




GTGCAGCTGa 


ACAATATACT 


CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 


2460 


50 


ymTGGtGCTA AgGtTAGCGT 
ATGG 


GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 


2520 
2524 



(2) INFORMATION FOR SEQ ID NO: 361: 
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20 



2S 



40 



45 



AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 
ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTCAT 
ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 
AATAAAAITG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 



180 
240 
300 
360 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTCCATTGTT AATATCATCT 4 20 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 
ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 
,5 GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGITTTAAC TAACGTAGCG 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 
(2) INFORMATION FOR SEQ ID NO: 360: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 60: 
TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 
TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TT-TGCCGAGG 
GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 
55 AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 
CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 
TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 
ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 
AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 
CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 
AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 
AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 
GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 
TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 



480 
540 
600 
643 



840 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA TGTTATTATT TTTATGGTTA 
ACXSTGCGTGA AGGATTGACA CAAAGCGATC AAATGGTCGC TCAAATTTTA TACAAATCTA 
' AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT GGAAATGCGT ACAGACGTGT 

ATGATTTCTA TTCATTAGGA TTTGGTGAAC CX3TATCCGAT ATCAGGGTCA CATGGTTTAG 
GTCTTGGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG TCAAGAGGAA GAAGATCCTT 
ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC AAACGTAGGT AAATCAAGTT 
TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC TAATGTTGCA GGGACAACGA 
GAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA TTATGTTTrA ATCGATACTG 
CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTGA GAAATATTCA GTATTAAGAG 
CTTTAAAAGC GATTGAACGT TCAAATGTTG TTTTAGTGGT CATAGATGCA GAACAAGGCA 
TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA ACAAGGTAAA GCAGTCGTGA 
TTGTCGTAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA AACGATGAAG AAATTTGAAG 
ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA AATTGCTTTT GTGTCTGCTA 
AAGWVCGCAC AAGATTACGT ACATTATTCC CTTACATCAA TGAAGCAAGT GAAAACCATA 
AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGirGTIAC TGATGCAATT TCCATGAACC 
CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA TGCAACACAA GTTGCTATAG 
AACCACCGAC ATTrGTTGTA nTGTTAATG ATCTAGAATT AATGcATTTT TCTTATAAAC 
GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTrTTGA AGkTACACCA ATTCATATTA 

TAGCTCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA CTAAAATTAC CGTTTTTGGT 

35 

ATCGGAAGTT TTGGGACAGC CCTTCCCAAT GTTCTTGCAG AAAATGGACA TGATGTTTTG 

ATCTGGGGTA AAAATCAAGA TGCTGTTCAT GAATTAAATA CATGTCATAC AAATAAAAAG 
40 TATTTAAAAT ACGCGAAATT A 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

< C ) STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

^r-TATf-T-rr nrccCAATTA CCAAGGGAAC TAnTGGCACC 

CTTTCGGAAA TTAGGATCnG nGCTATCTTG GC^UUAAIa^^ 



3420 
3480 
3540 
3600 
3621 



60 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG * 


600 


5 


ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 


660 




GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 


720 




AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 


780 


in 


ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 


O A f\ 




GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 


Q ft 




CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 


Q ^ r\ 

960 


IS 


ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTX3AA ACTGAGAAGT 


n ft O ft 




CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CATCGAAGCG AAAGTAACAG 


1 ft O ft 




AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 


114 0 


20 


TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTr:a rrrnzir-nn ar^i^ TsT-r.^./^^n^. 


1200 




AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 


1260 


25 


AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 


1320 


TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCX3GTGTTG 


1380 




ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 


144 0 


30 


TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 


1500 




TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 


1 C ^ ft 




ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTC 


T £ O ft 


35 


AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTC 


1 ^ O ft 

J. b o U 




GTACXSCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 


1 O^*! ft 




ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 


T Q ft ft 

X o U U 


40 


TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA 


1 fi c n 
X o o U 




CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 


1 Q O ft 

X y ^ U 


45 


AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 


1 O Q A 
X 7 o U 


GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGITT TAAGTCCtGa 


o ft >1 ft 




tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTat-piiiva 


2100 


SO 


AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 


2160 




GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 


2220 




TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 


2280 
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10 



IS 



20 



30 



35 



40 



SO 



<2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 

TX3ATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 

AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 240 

TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 360 

CAATGGTACC TATCGCAACT AATGCTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 420 

2S GIGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 4 80 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 * 
(2) INFORMATION FOR SEQ ID NO: 358: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

GGTGAGTCAA ATTAAATGAA TCTAATAAGT CATAACTATC TATTTGTAAT GTGCAACGCT ' 60 

TAACGCATAT ACAAAATGAA TGTGCTGATA ATGATTTACT CAAATTAAAA GGTGATTTTT 120 

ATTCAATGAT GAATGAAAGT TGCCTTTTTA TTTTTGGTAA AAGTTAATGC GTCAGTGAAT 180 

TGTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 240 

ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 300 

AAATATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 360 

GATGCACTAT ATACGGTCTT CTTACTATTC AAATGTAAAA GTTGCTTATT TGCGTGGCTT 420 

TTTCTTTTAT AAAAGTATAA AATTTTACTA TAATATATCT TGTAGAGAAC AATGAAATGA 4 80 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA 
TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 
GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTCAC CGTGATGTTT GGGCAAGATC 
TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 
ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 



(2) INFORMATION FOR SEQ ID NO: 356: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7B7 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



180 
240 
300 
360 



>JirtVjtwW\MiU l^J^UAiALjAX 420 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTrGTGCTT TCTrrTTATT 4 80 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 
,5 TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 
GCAATGAC 



540 
600 
660 
668 



60 
120 
180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 
ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 
ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 
35 CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 
GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 300 
ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 360 
TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 420 
CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 
GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 54 0 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 
TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 
AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 
ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 



45 



780 
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TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 2220 

CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 2280 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 234 0 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 24 00 

TTnnATATTA A 2411 
(2) INFORMATION FOR SEQ ID NO: 354: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 54: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 60 

AGGTCAAAAT ATTTTTGATG CGCTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 120 

AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 24 0 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 300 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 360 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 4 80 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 54 0 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 600 
40 TCCAG 

(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



605 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 
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AACAACTTTA TCGTTTGCAA CGACTAAACC TGCTTTAATC CAATCTTGTA TCTGGTTACG ' 


4 O n 




AGACCAATCA TTATTTAATT CAGGCAGCAA CTTATCTACA CGCATACCTG TTrGTTCTTT 


480 


5 


ATCTGTAATG TTAAATTCAT AAGTCTCCAT TACTTAACCT CCTTCTCCTT TTTATTGGAA 


54 0 




GTATCCTTTA ATAAGGCAAT AATAATTAAT ATTACACCAA TTGTTAAACT TGAATCTGCG 


V \J\J 


10 


ATATTAAATA TTGGAAAATC ATAACCAAAA ATATTTGTAT CAATAAAGTC AACAACTTCT 


660 




CCTGTTAAAA TTCTATCAAT AAAGTrrCCA AGTGCACCTG CAAAAAGTAA ACTAATAGCA 


720 




ACTTGCATAA ACAAATTATA TTGAGCATCT TTAATAAAGA AATATACTAA GGCTATTAAT 


780 


IS 


ATAATAATGG TAATAATAAA GAAAAATGTC ATTTTTCCAC TCAATATTCC CCATGcAGCA 


840 




CCATTATTTC GATGTGaTGT TATGTTTAAA AAGTGCX3GTA TCACTTCAAA TGAATCTCCA 


900 




ATTTTCATTG TAGTAGCTAT AATATATTTA GTAACTTGGT CAAATATAAC GACAAATACT 


960 


20 


GCTATTAAAA TGGAAGTGCC AATAAAATAT TTTTTGTarA TT^nn^nn -vnnr.r.rr.r^'^ r.rr. 

— - — — — — — — -----.«.->-.w i W >w.*"M-V A. 


X 0^ 0 




CGTCCATGAG ACAACTCTTT ATATTATAGC TTACACCTGC TAATAAAAAA AGTAAGCATA 


108 0 


25 


TTACATTAAA TCTAATGTTA CTAACTCAAT ACTTGATAAA CTACTATGTT TTGACATTAA 


1140 


ATATGAACTT AATTATTCAT TTATCATATT TAAGATGACA TTAAAAATTA GGAAAGCAGG 


1200 




CTGGAACATA AATCCCTAAA AAGACAGTAG TAAGATATTT TCTAATTAAA AATTATCTTA 


1260 


30 


CTGCTGTTCT CTATTTATAC AATACTTCGT ATTGAATGGC TTCGCTATGC CCATCTGGCA 


1320 




CATTACTGTA AAATTCTATA AATAGAATTT TTGATGATGG GTCCCTTCCT AGGGTGCCGT 


.13 80 




CTCAGCCTCG GtCTTCGACT GGCACTGCTC CCTCAGGAGT CTCGCCATTA ATACTACGTA 


144 0 


35 


TTAACATGTA ATTTTACTTT TAAATACTTT AAAAAAATAA GACATGAATC GTCTACACTT 


1500 




AATTGGACAA ATTCTATGAG AATAGATATT GTTAATTTAA GAAAGTAGGC TATTTTGAGT 


1560 


40 


TCCACTCGAA TGTCAGTTCG AGGAATAAAT AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 


1620 


ATGGCAATTA ATATAAGGAA AATAGCAGCT CAACGAGCTG TACATTATAA AATACATATC 


1680 




AAAAAAGCTG ATTTCTATCA AATAATTAAT AGAAATCAGC TTTTTTACAT TGCCTAAGAA 


1740 


45 


CTTAATGTCC CAAGCCCTAA AACTTGTTGT TATTTATTTG ATTTAGCAGC GATACGTTTA 


1800 




TATCTTAAGT ACATAAATGC TAAAAGTATA AACCAAATCG GAATAAAATA AATTGCACGT 


1860 




CTTGTATCAA CATTAATAAA TAATAACCCG AACACAAAAA TGAAGAATAC AAATATTAAG 


1920 


SO 


TAGCCCATAT ATTTGCCACC TAATAGTTTG TACGTAGCAT TTTTATGTAG ATCTGGGTTT 


T Q o r*i 
J- y o u 




TTACGACTAT AATTGATATA TGCAATGATA ATCAGACCCC ATACAACTAA AAATAACACT 


2040 




GTAGAGATGG TAGTCACATA CGTAAATACT TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 


2100 
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ACTTCPJ^TTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 
TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCX5CTTTACT 
TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 
TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 
AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 
GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTGCAG CGCTTTATGG 
TTTCTAAGGT GAAGGTCATG AGTAACGGGT AGTTCGGGAG AGGTTAACTT GCGTTGATTT 
TGATAAAGTG ATCATAGCTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 
GTCATGTTTA ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 
TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 
GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 
CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACmTAACC 
CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg ctTCGCAATG TTGCGCCATT 
TGTTCAATTA ATACTTrTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 
TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 
AGATTAAGTG TTTCTACrGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 
ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 
TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



8940 

9000 

9060 

9120 

9180 

9240 

9300 
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9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9956 
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50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 
TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG 
ITTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA 
AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT 
ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT 
AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA 



CAATATCATT 
TACCTGGACG 
TAACTAATGT 
ATACAACTGC 
TATCAGCTTC 
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TTATTGATGT GAGGTGAGTC TTGTTAGTTT GTTGCAAATA AATGGTCTTC GTCTTTTTTG ' 714 c 

TATAGGACGT TCTTAGTGGG ACATACGGAA TATTCGTGAT CrTTGTAGTC TCACGCGTTA 72OC 

TATTTTTGTG GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 726C 

GCGAAATCTT AATGATnTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 7320 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 7380 

GCATTTACTT CACTCATTTr TATGACTGTT AAACAATGAT TGTACCTTCA ATTAACAGTT 7440 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA 7500 

,5 TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 7560 

TAGATTGAAA ACAAAAAGAT CCTAAGAACA CCTTAACTTT TTATTAATTG TCATAAATTG 7620 

CAAACAATTA AGCCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 7680 

AATTTGCCAT TGATATGGAT CTTCAAATTG TTr;PPaATr»T' nmiTr^T^T^T-n^ r^>rnrr^r^^^^^,^„ . _ 

— ^ w . — ^— -T-~» A A A v-x J,^J^-VJ^...^A i //4U 

GACTAAGCAT GCGTCGAGTT CTTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7800 
GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACG 7860 
TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATX3TAACTGG 
ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 
30 TAGCCATACG ATACCTTTTG ATCGAACGAC ATTATTTGGC ATGCTTTCTA ACCAATCATT 
GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 
TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 
TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 
TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 
TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 
AAATTCAACT TGGTCAATTA ATAAATCAGC AATCGAACGC TCATCTGTTT CGTCAACGCT 
TTGATCACGA TCCATCAATA AATCTTCTGA GTTGATGTCA TGTACGAAGC GGTTAGCATC 
CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 
ATCAATATAT GAGAAAGTTT GTGCAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT 
AACAATTTGA TCGATGCCAC CTTTTTTCAC TAAACGCTCA ACTTCTTTTA ATAAATCGTC 8640 
TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8700 
TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8760 
AATTACCGCG ATACGTCGAC CTTCTCGATT TTGTAAAATA TGATTTAACA ACGTTGTCTT 8820 



7920 
7980 
8040 
8100 
8160 
8220 
8280 
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AGAAATACTA GAAAGGATAA GTAATAATGA TAACGCAAAT CCATTATCAA ATATCGGCGC 
AAAACGAGTT AGAATAACAC CACCTGCATT CACAATTCCT GCATGCATAA TTGCCGATAC 
TGGCGTTCGT GCCGTTACAG ATTCAATCAA CCATCGATGA AAAGGAAATT GTGCTGCCXX5 
TATCATGACA GCTAATACAA GTAGTACATT CGTCAACAAT GACCATGTCG GATGAACTAT 
ATGTTGTGGT ACCCGCCACT CGCCAGTCGC AATATAAATA GTTACAATTG CTCCAACGAA 
TGCAAGCCAA CCACATAAAA ATGTCATCCT TGATAATTTC GCAGACTCAC GTGGCACTTT 
CCAAAAACGA TTAACGTTCA TCAGCAATGT TAAACATAAT AATGTAATAC CCCAGCAGAG 
TCCCATCAGT CTTAAGTCTT CAGACATCCA TGCTAAAGAT GCAAACGACG TAATCGCAGT 
GAACAATGGA AAGTAATGTC TATAATGATG ATCACCTAGT AAATATCGCA TTGAAAACTT 
TOAATAATA AAGCCAAGCG CCATTACAAA GCCAGCTAAT AACCAAGATA AACGATCTAT 
TTTAAATGGA CCTAAGACAT GTTGACCATG AATACCGAAA AAGCCAATGA CTGCAAATAA 
TACTGGCATG ACTAGTATGT ATAAATGTAA TTTAATATAT CTCATTGGCA TAACTGGTGC 
TAAAAACAAC AAGCCACTTA TCAATGCAAT GATAAGCGCA ATAACAAACA GTGAAAATAG 



CAATTGAAAA CTTAACACTC CATAACCTCC TTATTTCTAA TCTCTCGCAT AATTCCTTAT 
GTATAAAAAT AAAAACCTAC AATAGTAGAT TCTGTACATA ATGGCAGAAA ATTTACTATT 
GCAGGTTTCA GTTTAACTAG ACACTGCATC ACGGTACGTT GATATACCTT GTTGCAGTCT 
TCTCTTTAAG CGTGCTCCCA TGCACATATC TATATAAAAT GTTACTTCTG TCTGTTCAAT 
TCATCTTCAT AAATATGCTT TGCCTAGACG AGACCTAACG TGTTATTCGT TTTAAACTTA 
35 TAACATAAAA TATAATTAAA TTTCTGCTTC ATGTCAAATT CATGAGCTTA ACCTCTATTA 

AACCAATCAT TGTAAAGATT TTGTAAATGC ACCTGTACAG TTAGGCAGTA TTTCCCGTCC 
TTTTAAAATA AAAAATTCGC AGTTATGATC ATAACAATTC AAGTTAGGAA AAAAATCAAT 
TACGCACAAG ATAACTATGT ACAATGAAGT TAACTCATAA GCAAAGGAGG TAATCTTAAT 
GGGTATCATC GCTGGCATCA TTAAAGTTAT CAAAAGCTTA ATCGAACAAT TCACTGGTAA 
ATAAGATTTC ATAACAAACA AAGGAGGTCT TTCACATGGG TATCATTGCA GGAATCATTA 
AATTCAITAA AGGATTAATT GAGAAATTCA CTGGTAAGTA AGTTATAAAA ATCTCATAGA 
TATGAACATC TTATTTGAAG GGGGCCATTC ACATGGAATT CGTAGCAAAA TTATTCAAAT 
TCTTTAAAGA TTTACTrGGT AAATTTTTAG GTAACAACTA ATCTCAAACA TTAACGATCA 
ACAACTCATC ACTATGTTAA ATCAACATAC AGGAGGACAA AACGATGGCT ATTGTAGGTA 
CTATCATTAA AATCATCAAA GCAATTATCG ACATTTTCGC AAAATAATTT AAGCGAATTG 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC 
ATTTTTAAAT ACATTTTTAT TCATAGTTGC 
GACGTCATGT TGTAGTAATA CATCGCTAGG 
AGTTTGCTTA AACCAATTTT CCGATCTACT 
ATCACCAACT AGCAATTGTT CGPLCPJ\!C:rTPJK 
TTGTTCAAAG TGATGCTGTT GTGAACGGTA 
CAAAAGATGT CCTTCAACAT AAGCTTGGTA 
TACTGACTCT ATCGTCATTT CAGGATCATT 
TTTAGTAAAA CTATGGTCAT GTTGCGCTAA 
ACGCTTCGGC ATTGTCCAAC TCGATAGAAA 
CATTTGACGA TTCACTTGTT CGCTAAGTGG 
CATCGGACGT ACATCATAGT GATGATATGA 
TAGTAGATCA GCAACAACAT CAACATTTGA 
TAATGTTTTA ATGTTATCAA TATAAAGATT 
TTCAAGTAAC ATATCAGTAA CAAGTTGATT 
CACAGCGCTT TCTATTAATG CTTTATTTGG 
CCATTTTGCG ACATCTTCAA ACGTATCCGC 
AAAAATCGAA ATTGGTGATA ATGGTGTAAT 
GATATTTAAC TGTGTTGTCA TACCTTTCAC 
ATGACTTTCT ATCGCTTTCG AGCGTGCTTC 
CGCAAAAGCC TTAGAGTATC GATGCCGCGC 
TAAAATAACA ACACTAATGA TGACACTCAC 
ATTTTGTAAT ACAGCGTAAA AATAATTATG 
AATCAAAATC ATACCAACAA GACGTGCCAT 
CCAAGATACA AGTAATGACC ATCCTAGAAT 
GTCACTACTC ATCCAAAATA GAAATGCCAC 
CCAGCCATAA GCGTCTTTAG CAGATGCTTG 
ACCTGATTGT AAAAATAATG TTGCTTTAAA 
TGCTGAATAT ACACCCAATG CACATTGAAC 



GTATGTCATT 
TGCAAAATGA 
CATATCACTG 
TTCACAATCT 
CCGAATTGCC 
ATACAACATA 
GTCTTCCTGA 
GGGTAAGCCT 
ATGCAACCAT 
TTGATCGATA 
CTCACCTTGT 
TTCAGCCATA 
ATGATTCATA 
GATGTAGTGT 
AAAGACACTT 
GAAAATATCC 
TTCTAATCCT 
AACACGTTTC 
CTCCTATAAA 
ACCTAGATTA 
CACCCAAATA 
TGTAGGCGGC 
TGTGATGACG 
GCGTCCTTTA 
GAGTGCACTT 
GATAATAGCT 
TTTTGGAATA 
AATACCGTGC 
TAACATAAAG 



TCCCAGGCAA 
ATAAACGTTT 
TAGTAACACC 
TTAGCGACTG 
AAATAATCCG 
CCTGCCCAAC 
TCTATTGAAA 
TTAATCACTT 
GCATGGTAAA 
TAAAGTTTCG 
TCATCTATTA 
TCACGTTTTG 
TATGATGCAG 
TGCGGGATAT 
TCATCTAATT 
ACATCTCGAA 
TCCCATGGAT 
GCATTTTCAA 
TACTTCTTCA 
ACTAACCACA 
CTTAATAAAC 
GTTGTCGCAT 
TAGATAAATG 
CTAAAGGCTA 
AACACTTCAT 
AATACACGTC 
TTGAATCGCT 
AATATTAAAT 
CCCATTTGAC 



TTAGCCATAA 3540 
GAATTTCATT 3600 
ATGATGCAAC 3660 
ACTTAAACTC 3720 
TTAACAAATG 3780 
CCGGTAACGC 3 84 0 

AATGAGTTAA 3 900 

GGCGCTGTGC 3960 

AACTTTGcTC 4020 

TCCATTTAAT 4080 

TTGCATCACT 414 G 

ATTTTTCTAA 4200 

GTACGTCTTT 426 0 

TGTAGTGATG 432 0 

CACCACGTGC 4 3 80 

CATCACGTAA 4 44 0 

TTCGTGCTGC 4 500 

TGACTGAATT 4 560 

AATAATTCGG 4620 

CGTACAATAC 4 680 

TGCCAAAGAT 474 0 

GTGTTGTTAT 4 800 

TCACXJATTGC 4 860 

CCATTTGATT 4 920 

ATGCACTTCT 4 980 

CCATGACAAT 504 0 

TCACGATAGA 5100 

GAATAATCGC 5160 

TCATCGTAGA 5220 
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TTTCAGCTTC GATTTCACCT TTCGTTCPrr TCATATCACT AACTCCAATA ATATTAAAAT 
TGATTACTTC ATCTTTGTAT CGTTATCACG ACATCAACTA TTACATTAAG TTTATCATTT 
TTAGTATATT TTAAGAAGCT AGAACATTGT AGATATGATG ATATATTAGT TACTTAGCAT 
CGCAACATAT CATCGTTAAA TCCAACTTTT AAAACGCCCT TCCTCATTAA CGCTCATTAA 
ACGCAGCCAA TGATTAGACA CCTTCCTAGC GAAATGCTCA TTATTCGCGA GTAGTCTTGC 
TACAACATAG TCGGGTGCCT GAATAACGAC AAGTAAACGA ATTGGCGAAT GATACATCGT 
CCGATCAGCA GCCATAACAG ATTGCCATGA TAAGCCATAC ATCAGATCAC TCGCATTACC 
TTGCATGACA CCAACACCTG ACGTGACGGT TTGTGTCGCT TTATTCCCAC TTCCGTAAAA 
ATGCGGCGCA ACTGTCGACG CATAATATTG TAAATTAATC CATTGTGCCA CAAGTGCCGG 
ACCAGAAATG ATGGTATTTA ATAATGTGCC ATCTTTATCT TTACGCCAAT CATAATTGTG 
TAAAAATGTC CGCCCTTCTA AATCAATGCC TTTTGTrAAT TGGCGTCX3TC CAATTATAAA 
TGATGCATTT TTAGCCAATC CCCATTCTGG ACGTACCTCA CTCCAATCAC TCGCAAACCG 
CTGCGCTTCT TCCACTGGAT GATTCACACG ACCAATCX5TT GGCAGTTTGT CCAAACGTTC 
GCGATTCGcg TGtTCAGAAA TCATCGGCAT CGCGTCATTC AATGATTCAT ATGCATCTAA 
AGCAATAGAA GATAATGTCT CTGGGACATA TACCCATGCC AACGTATCAG TAGACGTATG 
ATGTTCTGCT ACCGCAAAAA CAGrTGTCTC TGGAATATAC ACACCTGATT GTTTTAATCC 
TTGTCTGACA TTTGGACGAT TACATATCAT CGCTAATAAC TTAGCATTAA AACCGCTTGA 
TCCGCCACCA CAAGCCCCAC ATTCAAGTGA TGCATGATGT GGATTATTCT GAGAATGACT 
AGCATGACCT GCTAACACAA CGAACGGCGC AAATGCTTCG GTTAAATCCA TCAATTTCAA 
CX3CTTGTAAC GCGAAATCAA TTTGCTCTTC CTCAGTAAAT CCAACAGGTA AGTCTGATGT 
TCGGTCAAAC TCACGATCAA TCGTCAACTT TGTTTCAGGC TriTTCAACC ACTTTTGTTT 
TATTTTTTGT AAAGACX3CGC GACTTrTTCT AGGCATAATC GAATTGACAA TGGTACTTAA 
GCTTAAAAAT GGCCCACTTA ATTCAGGCAA TAACAGACTA GGCATGACAT TATTTTTCAT 
CAATTTAAAT GTCTAAAACA TCGATGACAT TGTCTGTTGC TGTTGTCGAT AAACATTCAT 
ATCGTAGCGG TCTGCAAATT CTTTAATGCG ATATGCCGGC GGTACCATGA CAGGTAATGA 
ATCATGTTTG AATTGTTCGT CTACGGCATC TTTTTGAATA GGTAATCCAA AGAAGCCTGC 
AATACCAATC GTTTCAAAGG GCCCTGCTGC TTCGATATGT CTACGAAATG GTTCTGAACG 
AACATCTATA CAAAATGCAA TTTGCGCTTT CGTTGATGTG CCCATCTGAT TTAGCTCGCT 
ATTATTTTCA TCAACTGCTT GTGTGTCATT TAACAATACT GAATGTGGCT GATTAGCGTT 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
, 2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



55 



•'^DOaO: <EP 0786519A2J_> 



1233 



EP 0 786 519 A2 





(xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 


352: 






5 


GCGTTGTCGT CGATGATTAA 


TAAAGTATGG 


GTATACCATT 


AAGAATAACG 


CTACCCAAAT 


60 




gAkTGCTAGT GACGTGCCGC 


CAATGACATC 


TGAAAAGTAA 


TGTGCATGAA 


AATAAAGGCG 


120 




ACAAAATAAT ATGCTAAGCC 


ATAATATTCC 


CATAACCAAC 


GCACTCAACA 


CTTTTGTTAT 


180 


10 


TGTCTTAGCA GCAAGTGAAA 


TAATAATGAT 


CATTAAGGCG 


AAATATAATA 


ATGTGCTGGC 


240 




GTTGGAATGT CCACTCGGAA 


ATGAAAAGCC 


TGTATCAACG 


GCTAAATGAT 


TATATGGTCT 


300 


to 


TGGACGTAAT ACAGTATCTT 


TAATTAATTT 


GTTCATGATG 


ACACCTGAAA 


CCAAATATGT 


360 


CACAAACCAA ACCGCTAAAT 


GCCTCTGTTT 


AATAAACAGT 


ATGATTGTGA 


CGATAATGGA 


420 




AATCAACACG ACACCTTTGA 


CATCTCCAAT 


TTCCGCACTA 


AACGTCATAT 


AGTAATTAAA 


480 


20 


CAAATTGTTA ACATACTGAC 


GTTGTGGCTC 


ACCGAAATAA 


TCTGTAAACC 


ATGTTAATGA 


540 




TCCCATATCT ATATTTTTAA 


GCCATTCTTG 


ATTTGTCACT 


ACACTGTAAA 


ACATACCTAT 


600 




AAATACAATC AGCGCGATTA 


AAAATAAAGG 


CACTGTcATT 


TTCGGTGATG 


TTAATTTTIT 


660 


25 


ATCTATCATC TTACAATCTC 


CTCGTATCAT 


CATTTTCATT 


TTACAAATGT 


TATCCATAAT 


720 




ATCAATGTGC CACAAATTTC 


ACTTTACCGA 


CAATATCAAA 


ATTATAAAGT 


TCATATTGTT 


780- 


30 


ATGTATATTG CAAATAAAAC 


ATTGTATAAT 


TGAAATAACA 


ATATTTTGCT 


ATTTTCAATT 


840 


TAGTACGATT TATATTTATT 


ATACAGAGGG 


GGTAAGGCGT 


ATCAATAGAG 


TTATTTTTGT 


900 




CTATATAGCG TTAATCATTA 


CATTAGTTAG 


ATTCTTTACC 


CCTATTCATC 


CATCATTTTC 


960 


35 


AAATTTGATT TACTGGATAT 


TTGTATTATA 


TTTTATTCCT 


ATTATACTAT 


GCGTTATCGG 


1020 




TTTCAAGGCC GAAAAACTTA 


TTGCAACAAT 


GGTCATTATA 


CCTAATTTTT 


TAGGAATACT 


1080 




TTATCGATTA TATGCCTACG 


TCACACATAT 


ACTCTTTATG 


TAAAAGGATA 


GTGGGCATGT 


1140 


40 


CTCGCAACAA ATATGCTTGC 


GTCGACTTGT 


CACCATTCGC 


AAATTTTATG 


ACATACGCCT 


1200 




TTCACGGGCT ATATTCAGAC 


CCACGCATTC 


ATCCACGTAA 


TAAACACATC 


ATGTAATAGA 


1260 




AAAACAGCAC ACCCAAATAT 


ATGGCGTTGC 


GCTGTTTAAC 


CAAGCATACT 


TCTATAGCTT 


1320 


45 


TAATAAGCCA GCAGAAGCAT 


ACCTAACCTT 


CTTAAATATG 


CTTTTCCAAA 


TTATCCTCAA 


1380 




GTTTGAATAC GATAATACGT 


TCACCTGTAA 


CTGTACTTAA 


ATCACTATGG 


AAGCTCATCA 


1440 


SO 


CTTTGATACC TGTAATTTTA 


AAAATGATAT 


CATTCAAATC 


TTGCTCACCG 


GATTCAACTA 


1500 




ATTCAGAACG TGTTCGTTTA 


ATATTTAATA 


ATCCTTCATT 


CGTACTACAT 


ACACGATATT 


1560 




CAGCTGGCGT TAAGATACCT 


TGTAAACTAA 


TAATCACCAT 


ATCTCTTAAA 


ATGTCTGATT 


1620 
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360 
420 
480 
540 
600 



720 
780 



900 
960 
1020 
1080 



TTCTyTATTT GTAaTAACGC TGCAAACATA TCCATAACCC CGCTTCTTAG AGCCCCTTTT 
AATTnATnAA TTAGGGGCTC TTATGCAGTT C3GTGCATTAG CAACCAACTG TATTCCTTTG 
* TCCCCTTTTA ATTTATTaAT TAGGGGCTCT TTTGCTGTTG GTGCATTAGC AACCAACTAC 
GTTCAATTTA ACCGAATAGT TTAAAATTAA ATACAAACCT TAAATTAGTC TAAAACTACG 
CCnTGGTTG TTCAACAAAG CTCGCCATGA GATTTACAAA AGAATCAACT TGTGGCAATT 
GCAACATGCT CGGATCATAA CTCATAAATC TCGAACGAAT CAGCGGTTCA TTATCAATTT €60 

CTAcrrrrrc aaactcaaat tgttctttgc tgatattttt catcataatt tctggcaaga 
,5 ttgtaacacc tacaccacta atcaacattt ctttgcaagt tgctacttga tccactgtaa 

TAGTTGCATG GTAATCTTGT TCTAAATTAT CGTTATACCA TTCTTTTATT TGATTTATAT 840 
AAATCGGATC AGCTTGAAAC TCTATAAATG GTAACTTTGT AACATCATCT CGTCTATTTT 
^° TTGGAMUAT AAAATAATGA TCATCATTAA ATAAATGTGT GTTAGCTAAA TTCATTACCT 
TATTTCCACG AGTTATCATA ACATGATAAT CTCTATGATT TGCTTTAATT TGTTCAGTTG 
AACCAACTTG CACTTGTATT TCAACATTAG GAAATTCGGC ATTATATAGG CTCAAAACTT 
CAGGAAGTAA GGTrTGTCCA ATCAAAGAAG AACACCCGAT TGATATTGTT CCATTCACTT 
CACCAATATG TGCCTGCATT TTCTCAAAAA ATAATCGCTC TCTTTTCAAC ATGTCACGAG 
CATCCTCAAT AATCATTGTT CCTTCAGTTG TTGTAATCAA TTGTTTTTTT GTTCTGATAA 
AAATATCTAC TCCAAAAGCA TTTTCAATAG CTTTTAGTCT TTGTGTAACA GCAGGTTGAG 
ATATATATAA AATTTCAGCC GCTTTACGTA ACGrTTTCGT TTCGTCTAAT GTTATTAGTA 
3S AACGATAGTC TTCAATCTTC ATAATTTCCC CCCATAAATT ATTCAATTAT TGAACTTTCA 
TGGCTACAAG CATTCATGAG TTCATTACTA ACGAATAATT TCACCAATTT TATTGGTATG 
GCTGCAGCTT GAATTACTTA GTTrTrCTTT TGTTGTTGGT GATTTTTAGT TTGATTATAT 
TGCTTAGGCT TTATTTGTTT GCTTTTTTCA ATATTAGTTT TATTTTGTGG CTTTTGATGA 

•rrr i -i T TGAG cctttgcatt aattttatta aagcagtaca tgattttctt ttggaatcct 

• TTAAAATCAT TTTCTAACTC TGCCATAATT TGATGTGCAA TCATATATGC TTCATGAAAT 
TGCTTTTTTG TAATTTGCTC ACTTTCTAAT GCAAACATTA AATCATCTTC ATCTACCAAC 
TCATaTcACC ACTTGG 
so (2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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20 



TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TAnTGTGGc' 
ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 
CGAGACATIT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 
GAACCAAGGC GTATATAACG GITTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 
CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATrTA 
TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTCGC 
ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 
,5 TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 
TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 
GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 
TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTrr.aaAT T-r-r^^o-.-n^. 

— — ^ w w>_ w A i VJrt 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 
AATATTATTA ACAATATATG TAGTCGTAAA AGGPAAGAGG ACATGAGAAC TTCGGTGTTG 
ATTGGCATTA CATAACGCTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 
AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 

3^ TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 
ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 

^5 TAATCTCAGA AATGCTATA 

(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 1816 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 



so 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 
CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 
TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 
AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 



60 
120 
180 
240 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 1320 
ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 12 SI. 

5 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

^0 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 60 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 120 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 24 0 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 300 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGrTAGA ATGCCTCATT AATTCCATTA 360 

ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
(2) INFORMATION FOR SEQ ID NO: 350: 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TCATTTTCAT AGGTTATTAC GCAGATCAGC ATAATAATCC ATTCCATATG AGTcCTTATT 60 

TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 120 

CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 180 

TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 24 0 

TCATTGCTAT TATTAAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 300 

5^ GTTATGATAT GAAAGAACTT GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 360 
ATGAGCCCGT TTCATTAGAG ACATTTGCAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 
CATTATTGGC ATCAATGTAC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATk 



420 
480 
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10 



IS 



20 



25 



35 



40 



1935 



TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATCT AAAACTGAAA TTATTCTGCT- i860 
GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 1920 
AAACTCX3TAA AAnCC 

(2) INFORMATION FOR SEQ ID NO: 34 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

cenmccTA aacaattttt agatttagac aacaaaccga ttttaatcca tacattagaa 

AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATC^^ATG 

acgcatacga aagatacact tagaaaattc aaaatttctg atgaaagaat tgaagtcatt 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 
AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 
CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 



60 



ISO 
240 
300 
360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 
GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 480 
GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 540 
ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 
ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 
TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 
TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 
AAGATATTAT ACTGGTAGCC GTGATGAQAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 
AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 
TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 
50 GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATAIT GATTTAAGTA TTATTTCATA 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 
AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 



55 



960 
1020 
1080 
1140 
1200 
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SO 



ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 
TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 
ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 
TAAAAATAAT GTGTGTACAA AGTCATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 
ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 
ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGcnC 
AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 
AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG 
ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 
GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 
ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 
AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 
GATAATTGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 
GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 
GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 
GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 
ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 
AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 
ATTCATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 
CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 
ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 
GGCGGCAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 
GTrCCTGAGT ATGATAACGA CTTTATAGAA TCGrTTTTCC AAAATATCAT TAATGATGTG 
GATAGCAATA AGCTTTCACT CX3ATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 
AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 
ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 
GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA TCAAATCGAT 
GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 
CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

€00 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
TACAGTAGGA ATCATAAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 
TGTGTTCCAA ATTTCATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 
GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATCAAC 
TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 
GTTATCTTTT AAACCITrAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 
AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAAITGTT CAACAGCTTG 
GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATCATACC 
TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCIT TAGGTAATAA 
TTTTCCGCGT RTaaTnf3/-a/-! 

«>**-»._j.vjv_x«.v. xv»v_rt.n_f\(ji>v ViVjAUATAAAA TTGCTGCTAT 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 
GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 
TAATTTTGTT CGAGAGACGT GGGTACCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 
CATAAATACT TCAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 
ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 
ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 
CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGGC ATAATGACAT AGATATATTG 
ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 
TGTTTAAATT ATTGATTGTc ATATCTGAAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 
GCGAGCTTTT CAGCATTCGT TTTGTATTTT TtATAAAAGa AAAATAaAAA TATnAACCaG 
AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 
AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 
CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACX5GGA ATTCCCAATT 240 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 300 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 4 80 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 540 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACX3AAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 720 

ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 960 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATG7VATTC ATTCACACCG 1080 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 1260 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 1320 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 1380 

CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 144 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 1620 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 
(2) INFORMATION FOR SEQ ID NO: 346: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT 

TCCGCTTGTT GThTTAPJKAA AGGTTTTGTA GATTGATTAT 

TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG 

TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG 

CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA 

CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT 

(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TAACAATAGA 
TAATATGCCA 
TATAGTTGAG 
AAAATAGAAA 
ATTAAAATTT 
CGTGCATCTT 
CGTTCAATAT 
TTCAAAATAT 
AAAAACTAAT 
TATGGATCTA 
TTAACTGATT 
GTAAATTTTT 
TTTTTCTTAA 
TTCATCCAAT 
GTTGGTTTGT 
CAATGACCAA 
AATTGGAATT 
TTTATTGGGG 
ACTTA7VAAGC 
TTAATAAGCT 
CTATTAAGAA 
GAATTTCATT 



CAATTGCTTT 
TTCACTCATG 
ACATTCATGT 
TATTGTAAAT 
GGCAGTTAGT 
TAGCATCGCC 
ATAATTTGTA 
TGGGCCTCCT 
TTAACTATTT 
TAAAATTATC 
TTTCACTTAC 
TGCTGTTCAA 
CCCATTCAGC 
CATTTTTGCC 
CGCCAGGCCC 
AAAATCscCA 
TGTCATCTTG 
ATACGACAAC 
TCCTTGTTGA 
TGtAAGACTA 
TnAAAAACAG 
TATAGCACA 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3719 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 5: 
TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 
AAGAGGTTTT TCTATTTCXSC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 
CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 
GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 
ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 
AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 
GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 
GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCX3TA TATTTATGCT 
GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 
TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 
GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 
AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 
AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 
AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 
GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 
AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 
TGTTACAAAA TATTTAGTGC AAAAGCACGA CGGAGGTATT CAATATGaAT AACGGTACAG 
TTAAATGGTT TAATGCAGAA AAAGGTTTTG GT-TTCATCGa AAGAGAAGAT GGTAGCGACG 
TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 
AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCX3AGCA AGCTGCAAAC GTAGTTAAAA 
TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC 
GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 
GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 
TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 
TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 
TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 
AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 
TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 
ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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AiiAGTTGTC ATTACTCCTG 


CAGCAGGATA 


TGTAACATAT 


CTTAGTGCAA 


CAATAATGGC 


2400 


xiiAAXAGGA GGTATCTGTT 


GTTATATTGT 


CATTAATTAC 


ATCAAGGTAA 


AACTAAAATA 


2460 


n-Ai\iATGCA TTAGATGCAT 


TTGGTATTCA 


TGGTGTTGGT 


GGTATTATTG 


GTGCTGTTTT 


2520 


AAUACjUAGTT ttccaaagta 


AAAAAGCCAA 


TCCTGACATT 


GAGAATGGCT 


TTATTTATAC 


2580 


Avj*^ i\aAi_ATA CATATTATAC 


TTGTACAAAT 


ATTATGTGTA 


ACAGCAGTTG 


TAATTTTTAG 


2640 


A 1 1 («A. i ACGTTTATTA 


TTGCGAAAGT 


AATTAAATTa 


ATTACACCAT 


TATCTGTTAC 


2700 


(jVoAAC-AAGAA acgaatatag 


GATTAGACAA 


GATTGTTCaC 


GGTGAACATG 


CTTACTTTGA 


2760 


MUrtj i UAij 1. 1 A aatagattca 


ATAAACATAT 


TCGATATTAG 


AATATATTTA 


CATAGAATAT 


2820 


TCATTGTCCT GACATTTAAC 


TAAAGGTTGA 


TGTTGGGACA 


TTTTGTTATA 


CAAAAGTTTT 


2880 


ATTTTGAAAT Ci^iTiTATGA 


AAGAAGCAGA 


AATATTATTT 


AAAGCGGTTA 


CACATATGCT 


2940 


AAAATAAGGC TAAGTGTCAC 


AAATAATGAT 


AGGTGAATAA 


GT ATGAAAAA 




3000 


ATTGCCAAAT TGGCAGGCGT 


TTCAAAAAGT 


ACAGTATCTA 


GATTTTTAAA 


TAATGGATCT 


3060 


GTCAGTAAAA AAACAAGTGA 


AAAATTAACA 


AGAATTATAG 


CAGAACATGA 


CTATCAACCG 


3120 


AATCAATTTG CTCAAAGTTT 


AAGAGCGAGA 








3150 


(2) INFORMATION FOR SEQ ID NO: 344: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 





GTTATAGTGA AATTGACTCA 


TCACATTTCA 


CAGACCGTGA 


CAAACGCGTT ATTAGACGTG 


60 


40 


ATCATGTTAA 


AGAAGCACAA 


AGCTTAGTAG 


AGAACTATAA 


AGATACACAA AGTGCTGATG 


120 




CTAGGATGAA AGCCAAACAA 


AAAGTTAACA 


CATTAAGCAA 


ACCGCATCAA AACTATTTCA 


180 


45 


ATAAACAAAT 


TGATAAGGTT 


TATAATGGAT 


TACAACGCTA 


ATCCAAAGTA AATTATAAGT 


240 




TATACATCTC 


GTi"rrrAAAT 


GACAATTTAT 


CCCCGTAAAT 


ATTATAAATA ATCTTTTCAA 


300 




ATTCCACATA 


GATATAGAGA 


CACTAATAAA 


CCTCTTTGTC 


TCGATATGAT AGTCTGCAAC 


360 


50 


GATTCATGTT 


GTAGGCTTTT 


TAATTTTACA 


AATAAGGCTA 


AATATATAAG TTCTGGCACC 


420 




TAAAATATAG 


AAAATACATA 


AAAGTAAGTA 


TAGTTATTTT 


ATTATAATTA TTAAATTTTT 


480 




ATTAATTAAT 


TGTAAAAATG 


TATAATTATA 


ATTAATTAAC 


GTTTAATATT AAAATTAACT 


54 0 
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AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 
TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 
AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 
TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 
AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 
TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 
TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 
TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCX3TCC AATGAAAAAA GTACACCAAC 
AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 
TGATCAAATA AAAGTGATTG GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 
AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 
TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 
GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 
AATTACAATT CAAAAAGTTT AACGTTATCA TTTTAACAAT AAAATAGATA TTAGATTCTA 
TGGCTACTTC CGCTAATTTA AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 
CATTTTTATA TAGTGTGGCA CATTTTATTC CAAAAGATGT AATAAAACTT AACGCATTTT 
TGCTTTTTAT AAATTGTCAG ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 
CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 
TTTATTTTAT GGTGGGTTAG TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 
GGCAGCAATT GTGCTTGTTA CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 
GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 
GACTC7VAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 
GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 
TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 
GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 
TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA 
CAAACATTCT GAATCAACAC CACATAATCT TATCAl-rACG TTGAITUGCG GTATATTCGT 
GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGCGAT 
GCTTGCATTT ACAAATACTG TCATTTCAGC CAGTGCAGGT GCTATAGGTT GGTTAATTTT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 
AAITGTOGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 
CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 
TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 
ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 
ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 
TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 
GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 
TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTCT 



3120 
3180 
3240 
3300 



EP 0 786 519 A2 

AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGAITA TTTCCATGCT TTAGATCGAT 
TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 
TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTCCTrr AGAAGATGTT GATCAGT-PTA 
TAGATGATCT TAAAAAATAT GACAAAATCG AAAITCTAGG 7TTAATGACG ATGGCACCAT 
TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GrTACGrTTG AAAAAAGAAG 336o 
AAATACAACG ACTCAATTTA GAATATGCGC CTTGTCATGA ATTATCAATG GGAATGAGTA 3420 
ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTCGG ACTAAACTTG 
TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 
AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 
AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 
20 TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 

AAATAATTCA CGTAATGTTG TAACTATGAA CAA1GCTACA CCAAACAATG CATCACAAGA 
AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 
TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 
GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 
(2) INFORMATION FOR SEQ ID NO: 343: 



3480 
3540 
3600 
3660 

-i ^ /-V 
•J / ^ U 

3780 
3840 
3900 
3931 



60 
120 
180 
240 
300 
360 
420 
480 



55 



JSOOaO: <EP 07e6519A2.l. 



1220 



10 



15 



20 



40 



EP 0 786 519 A2 

GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 1380 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 1440 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 1500 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 15S0 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 1680 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 1740 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTTTC 1860 

aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTAGACGA 1980 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 204 0 

2S CATCMTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTIGATAGA 2220 

^° AAAAATTGGG TGTTTCCCAT TCAAACACAT GPJiPATPJ^G TCGCTTGTAT TACAAAGGAT 22 80 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 2340 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 2400 

''^ TATAGTACAA AACATCATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 2460 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG TCATGGTATT 2640 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

4S TATACGACAG CGTATGCTAC ATCTGAACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2760 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

^° AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2940 

GAGCTAAAGA AGCGTATCAG GCIGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA QPJATTGMkA TATACAATCA 
TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 
TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 
(2) INFORMATION FOR SEQ ID NO: 34 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 931 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 
CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 
TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 
CCATCACAAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 
AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 
ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 
GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 
GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 
GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 
TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 



1218 

SNSOdCtO: <EP 078651 9A2_L> 



4440 

4500 
4557 



660 
720 



GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 
TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 
40 GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA PJ^AlTTMiCAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 840 
AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 
GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 
TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 1020 
CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 1080 
ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 
ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 
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960 
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GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAT TAACTGAACC AGCAATGATT 264 0 

TCATCAGAAA ATCCGAGAGA AATTGCGCAT ATTGCTGAAA TTATGATGAA AGAAATTGAC 2700 

ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT TAAAGCTTAT 276 0 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 282 0 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 294 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3000 

GAATTAAtAG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3060 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACGAGAA 3120 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

TGGTTTAAAT AGCAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 3240 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3300 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3360 

ATTTGAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 3420 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 80 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 354 0 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3600 

AAGCAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 36 60 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCATVAAAC AGCTAAAGAT ATTGAAGCAT 3720 

GTCAAAAGGA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 3780 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3840 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3900 

ATACATTATC AATTAAGCAA GGGCTTGATA GGGTGAACCC ATGATGAAAG ATGTTAAGCG 3960 

AATAGATTAT TTTTCTTACG AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4020 

AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 4200 

TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4 260 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAATCATATC CTTTATTGAT 4 320 
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AGGGAAATCT GAGATTATCC AATCATACAT TTTATCrTTA GCTATTAATT TTCACCCTCA 
TGAAGTTCCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 
TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATCGC GATGAAGCGA TGCGTGCCTT 
AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTITA TTCGGAGAGC ATGATGTTAA 
CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 1080 
TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATT^TATGAA 1140 
AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 120O 
ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT ITAAGTTGGC 12S0 
ATTAAAAGTA CAAGATAGAC AAGACAGTAA TCAAATTTTA AAAACACCAG ATGCAGCAGA 1320 
CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 1380 
CCAATCTGCA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAGTTGAAGA 1440 
TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA AAGACTTGAG 1500 
TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA isgQ 
TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCX3TCCAT GGCTACCACC 1620 
ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC I68O 
AGATGATGCA AAAGAAGTGG AATTAACATT AGGACTTAAA GACGTACCAG AAGAACAATA 1740 
TCAAGGACCG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGITAA TCGGAAGTCC 1800 
AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG i860 
TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 1920 
AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA GAAGACAAGA TTCCTAAGGC 
GATACGTATA TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 
CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 
TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 
GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 
AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATCAAA ACGCGTATCG CAATGTTTTT 
ATATGATAAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 
TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 
TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATCAAGTAT CGGCGATGAC 
AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 840 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 114 0 

ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

TCCAATCCAG CCATATAAAC AGCCTATAAT TCCCGAGC 1358 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 
<B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGOGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 24 0 

CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 3 00 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATGtGAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

45 TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 480 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 
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ATTTCATTTT CAGCTGATAA 


AGCACCATGA TATrpATAnn 


AC» CGCATACC 


TTTAAGAAAG 


lOBO 


ATTGTGTCTT GCATTTTCAT 


TCTCCTTTAA AAAATCTATA 


1 IACjCTTA 


ATTTAGCATT 


1140 


CAACTCGACA TTATGAACGC 


GTACTGCTCT AACGCCTTTC 


M 1 AA i. ACCZAT 


ATGCAGTCGT 


1200 


AGCTGCAGTT ACTTCATCTC 


TTTCAACCGG TGTTGTATCA 


T A A r^/^/^R T'i^TV 

X Ajft.i« C \--ATCA 


TCTCTTTAGT 


1260 


GAAACGTTTC CGGCTTGTCG 


CTAATAAAAC TGGATATTCT 


\3 X. lV9\..AAw\A 


GTTCATCCAG 


1320 


TCTTGCCATA ACTTCGGCTT 


CTTCATTTCT AGTTTTAGCG 




C i GGATCTAG 


1380 


CCAAATTTTA TTTGAAGGTA 


TACCAGCTAT TTTAGCTTGA 




CTAACAAAGA 


1440 


TGTTAACATT TCTTCGACAA 


CCGGTTCATC ACRATTArra 


X x^rcCATTAT 


GCATTAAAAT 


1500 


AATTTCCGCG TCATAtTTAG 


CTACAATTTG GAACATArrrA 


1 vaATCATACA 


GACCgCCcAT 


1560 


tGATCATTAA TCATATCAAC 


GGCTAATTTT AAACATGCTT 


CAGCAACCTC 


ACTTCGAAAT 


1620 


GTATCGACTG AAATTTTTTA 


CATCAnAACC GACAATAGCT 


TCAnCAACAG 


TAATACTCTG 


1680 


TTCCATCTCT TCTG 








1694 


(2) INFORMATION FOR SEQ ID NO: 34 0: 









. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


340 : 




35 


AGCATTTCTT 


TTCTATAAAC 


ATTTAATTGA 


ACATTATTAA 


GTACACTATT ACTATAGTCA 


60 




CTATATTGAA 


CaCATACCTC 


ATTTAATTCT AATAGCGGTT 


CAGATTTGTA CTTATTATCA 


120 




TTATTTGCAG 


ATGTTTCATC 


TATCCATTTT 


TTCACTTTAA 


ATTTAACATG TTCACTCATA 


180 


40 


CAAACGTCAC 


GTAAATTCGC 


TAAGTTATCA ATGGATTCGA 


CATCTACTTC TGCATATTTA 


240 




AGCGCTGTAC AGTATAATGG 


TTCACGTATG 


CCTGCTTCTT 


TAAGCTTAGA TGATTTTAGC 


300 




AAATCACTAG 


GCGTTGTATT 


AGCGATGATT 


TTTCCATCTT 


TAAAAAGAAG AACTCTATCA 


360 


45 


AACGTATCAT 


CTAATGATTC 


TTCTAATCGA 


TGTTCGACAA 


TAATCATCGT TQACTTTGTT 


420 




TCTTCATGAA 


TATTGTTTAA 


CAATCTCAGC 


GTTTCATGTC 


CTGTCGCAGG ATCTAAATTG 


480 


50 


GCCAGTGGCT 


CATCCAATAT 


TAAAATAGGC 


GTACGATGGA 


TTAATATACC ACCTAATGAA 


540 


ACGCGTTGTT 


TTTGACCTCC 


AGATAAATCT 


TGCGGTCGGT 


GATTTAAATG TTCTATCATG 


600 




CCAACl-rriT 


CAGCCCAATA 


ACTTACATTT 


TTCTTCATAT 


CATCTTGTTC AACACAATTA 


660 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2880 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2940 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 3000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3060 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
(2) INFORMATION FOR SEQ ID NO: 33 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 
CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 60 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

GATAATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 300 

CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 360 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 420 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 480 

40 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 

TGTTGTTTSA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGrrrCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 780 
ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 
ATAATTGATT TAACCTCTTC GAACACTTCA CCATAATGAA CTGTATCAAT AACATTATCA 960 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTCCTAGAA 
CGCATTTTTA TACTATACAT -GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 
CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAACA TTCGCTTAGT 
CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TITAATAATG TrTCATCAAT 
GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTCaACTATT 
ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TAITTAATTT 
AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 
ATAAATTGGC AAACTACTTG AGTAGATGAG TCGTCTACCG TGATTAATTA aCATATCCtT 
CaCATCaXTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 
AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TATGTCAAAC 
,0 ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTCAC CAATATCCAC 
AACGTCGCCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTCGTA TATTTTCATT 
AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 
TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 
CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT 



45 



1680 
1740 
1800 
1860 

ArrTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATrTCTTC 



^^^w^^w^x wv^iioiii UAATTTCTTC 1920 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2040 
AGATGTATAG TTCGTACATC TCTTATTTTC TACITCAATA TACTGTCTAT CTATATACCC 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTAAT AAATCAATTT GCTCITCAAA 2X60 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 2220 

^ TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 2280 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGIT GACCTCTCTT 2340 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 2400 

AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TrGTGCCTTT GATTOGATGC 2460 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATITCITTTA AAGCAAATGC CATATCAACA 2520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATC7.CGCCAG AACATGGTGA TATATTATTC 2580 
GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 
TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTCACTTA 



2640 
2700 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT 3060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

^ GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3240 

TTAGAACATT TAGGATCGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 3300 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 



15 



20 



30 



35 



40 



50 



G 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxxble 

(D) TOPOLOGY: linear 



3421 



25 , (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 240 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

CAATX3ACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 360 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 420 

TCGTCCACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 4 80 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 540 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 600 

•'^ TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 720 

ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 
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TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TACGTACGTC " I26i 

ATTATCATAT ATTGCACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 1321 

TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 13 8( 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 144, 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 150( 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 156C 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 162 C 

CATGGTGGTG CAAACGAACA AGTTATGACG ATGTTATCTG aGATTGGGTC AaTTGAAAAT 16 8 C 

GTTGATGCTT ACTTAGATGA AAAATTTGCT AATAAAGrTA AAGTAATGGG cTTCGGTCAT 174 c 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 180C 

20 mCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGCGAGTGT TTATCACTGT 1920 

ATCGPAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 1980 

ATTGCTCATA TTTTAGAACA ATATAAAGAT AATAGAATTA TGCGTCCTAG AGCGAAATAT 2040 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT CGGTGATGGA 2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 2280 

TATAATGGCG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2340 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATA TTTAATTGCT 2400 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TX5TGGCTTTA 2460 

40 OGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 2640 

TTACAAAACG AAATGGGTGC GACAAACATT CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TTAGTTAGAG CAGCTATACA ATATGCTATC 2760 

GATAATAACC GTAAATCAGT TACTTTAGTT CATAAAGGTA ATATTATGAA ATTTACAGAA 2820 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 2880 
ACTTGGCAAC AATATGACGA AATTGTTGAA AATGAAGGCA GAGATGCTGC TAATGCTGCT 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT ' 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

5 ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 



(2) INFORMATION FOR SEQ ID NO: 337: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

2^ GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 12 0 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 3 00 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 480 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

35 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 780 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 840 

TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 900 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 1020 

50 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 1080 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 

55 
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ACCAACCGTA AACTTGTTrT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 
CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 
CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 
ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 
CACCATATAT GAAAGCTCC 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 
CGTATTAAAT TGAATGGTAG GACCAATCAA AGTATATTCA ATCGCTGGAC CATCATTGTT 
AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 
AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 
AATCITAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 
TATCTCTTTC GATTTGTATA AATTTCTCTT CATCTATTGC ATAAAATTGT ATCCATTCTC 
CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 
CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGnTGA TTATTCGCAA 
TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 
GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 
AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 
CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 
TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT. GTTAAGTTCT AAATTTTCAA 
TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 
GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 
CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 
TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 
CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 



1200 
1260 
1320 
1380 
1399 



1080 



ss 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA lOSO 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 114 0 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 1245 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

CTTGTAAAGG TAGTTGTTCA TTTAAATTAA AACAGTATGC TAAGTTnGTG CTTATATTTT 60 

GCAACTTCAA TTCGACAGGC TCATCTTGAT ACATGAATGC CTCAATTTCA TCATGTGATA 120 

ACrmTCAA AATATCAATT AAATGrATAC TAAAAACGAT AAATAACATA TCCCAATTTG 180 

AATTCAGCCC TAGCGATTTT AATTTGTTTA AAATCTTATC TTTTTGAAAA ATTCGATTCC 24 0 

TAATGTCTTG TATATCGTTA TCAGTCAAAG TTTCCCAATC TATATGTGAA TGAAGACCTA 300 

AATAACACTT ATCCATTAAT AATTCATATA CCGTTAATGC AGAGACATCG AAACAACGTT 360 

CTTCACTTAr AAAAACGCCA TCAACATCAA ATAAAATTTT CTTCACAATC CCCACTCCAT 4 20 

TTCTGaAAAT TCAGaTATAA ATCATTCTAC TATTTGACTA AAAAAAGCGC AAACCCTATT 480 

GAAGTaGATT TGCGCTTTAG CTGTTnAAAT TTTATAAATG TnTTTCAATT TCATCAGCAA 540 

CCTGCTGTAC GTGTGTACCG ACAATAACTT GAGTTGAATG TTTGCCATTA ACAGTAACAC 600 

40 CAACTGCACC GGCGTTTTTA ATCTTCTGTT TATCAATAAT AGATGTGTCT TTTAACTCTA 660 

GACGCAACCT TGTTGCACAA TTGGTTAAAT TAACAATATT CTCTTGACCG CCTAAACCTT 720 

CTAATATTTG TATAGCATGT TGATGATATT TACTTTGTTT AATATCATTT TCACCAGGAG 780 

CAATATTATC TTTTACAACT GTTGGGTCAA CTAATTCATT TTCACCTCTA CCAATCGTAT 84 0 

TCAAGTTAAA TACTTGGATT ACTACACGGA AAATCACATA GTATAAGATG AAAAATACAA 900 

CACCTTGAAC AAGCAACATC AATGGATGAT TTGATACTGG ATTAATTAGT GATAACACAT 960 

AATCTATCAA ACCTGCACTA AATGAAAATC CAGCTGTCCA ATGGAATGTA GCTGCGATAA 1020 

ATAAAGATAA TCCTGTTAAT AACGCATGAA CAACATATAA GATTGGCACA ACAAACATAA 10 80 
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CAAAGTTTGG CAAGGCATCT 


TGTACPTPAT 


1 lATAACTCT 


AAATCCcATC 


ATTTTCAGTA 


3640 


GCATACATTA GTGGATGCAC 


ATGAT"J"J"I7 T 

«• ^ NJrt ^ X X A V. J. 


'K IV n ^ ^^/"i ^^^^ 

AAA t TACG CG 


CTAACCGAAT 


TCTAGAAGAC 


3900 


ATAACAATTG GTGTTTCTTC 


ATTACTTTTC 


AlWUAi iTjOC 


TGATATTATC 


ATGAATATTA 


3960 


TGCGTCATCA TGTTGCACCT 


CACTCTCAf^r* 


X 1 1 X AIjIXjCT 


TTAATTTC7VT 


CTCTAACAAT 


4020 


GGCTGCTTCC TCAAAATrTT 


wjA 1 1 1 CAAT 


AAG'i*'l"l"l"l''rc 


AAATATTCAT 


TCTTTTCTTC 


4080 


GATTTTTCGC TTTAAAGCTA 


TCTTTriATG 


TGAAGAATGT 


GGTGTCTTTC 


CAACGTGCTC 


4140 


AAATTGTCCA CCTTGAACTC 


TGCGGACGAT 


ATCAATGATG 


TCATCTTTAA 


ATGTTGCATA 


4200 


ACAATTAGCA CACCCAAATT 


TACCAACATG 


TGCAATATC 






4239 


(2) INFORMATION FOR SEQ ID NO: 334: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 





(Xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 


334 : 








CAAAACTTTG ATTATGTCGT 


GTCAGTTATT 


AATGCATACA 


GTAATGCATC 


GTTGTTATTT 


60 


30 


TTTATTTATT TACTATTTAT 


TATGTACCAA ATCAGCCTTt 


CAGTAAAATA 


GGTCAAATTA 


120 




CTGATTTTCT AAATATAAAA 


TGCCTCCTAA 


TAACATACTA 


TTAGTACATC 


ATTAAGAGGC 


180 




TCTTGTGTTA TTTGCATACT 


AAGCGCTCAA 


ATTTAAATTT 


AAGATGAAGA 


TTCTTGCAAT 


240 


35 


AATATTTCTA TATTTGTCGA 


TATATCCAAT 


GGATCTTCCA 


TTGGCAAGAA 


TCGATTTACA 


300 




ACATTTCCTT GTCGATCGAT 


TATAAATTTT 


GTGAAATTCC 


ATTTTATTGG 


GGaCCCAAAG 


360 




ATTCCTGGTT GTTCaTTCTT 


TAAATGCGtA 


AATAACGGAT 


GTTCATCGTT 


CCCGTTCACA 


420 


40 


GATATTTTAG CTAACACTGG 


AAATGTTACA 


CCAAATTTCT 


CACGACTAAT 


TTTCAAGATT 


480 




TCTTCATTAG AACCTGGTTG 


TCGATTGTCA 


AAATTATTAT 


TCGGAAAACT 


CAACACTACA 


540 


45 


AACCCACGAT CCTTATATTT 


TTGAAATAAA 


GTCTCTAGTT 


TTTTCAATTG 


TTCGCTATAT 


600 




ATACATTCTG TTGCAGTATT 


AACAACTAAA 


ATCACTTTAC 


CTCTAAATGC 


TTCTAATTTA 


660 




TAAGTTAAGC CTTTATAATC 


ACTTACTTCG 


ATATCATACA 


CATTTCTATT 


ATTCATAAGA 


720 


SO 


CACCCCTACA CAGCCTTTTT 


TATATTGAAT 


ATGTTCTTTT 


TAGAATGTTC 


TGATAAAATA 


780 




AGTGCGCGTT TACACCGTGA 


ACACACATTA 


TATAGCGTGA 


TACATTTTTC 


GAGCACACGA 


840 




TAAATAATGT TCGAGTTTAT 


GTTGTTGCTC 


AACCTATCCG 


ATTTACCGTC 


TTTTTTCACC 


900 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACXA CGTGCTAATG 2040 

CCGGCTTCAA AATATTCGAA GCATCGATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TATGCAACTC ATCAATAAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 2160 

TTTTCAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 222 0 

CTAAAGACAT AACACGCTTA TCTTTTAATG TCTCTGGTAC CTCATTATTC ACTAtGGCtT 2280 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 2340 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 24 0 0 

TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 246 0 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 2520 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAAATCTAGA TTTGCAAAAA 25 80 

20 CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 2700 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 2760 

CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2820 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TATATTTGAA TGATTTAAAC 2880 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA cGCXJCTGTGC ACGCTCAGTT AATCTACCAA 294 0 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3 000 

TTTACAGATT TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 3060 

TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 3240 

40 GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3300 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGTCCATAA ACTTGCGAAC 3360 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 3420 

TCATTCTTTT CATAATAGAT AGACCTGGCA AATGTAGCAT CACGCTTGCT CTCATACCAG 34 80 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TTCATCATAA CTTATATCAA 354 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 56 00 

SO 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 3660 

ATTCATCGTC ATTCACTAAT ACTGCGGCTG CTGGTTGTTT TATTAGTTCA GGACTAATCA 3720 

55 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTTAt' 
CACAATATTA TATAGACCTA TnTATTT^ ACGTAAGT.X. CTAGTATCTT CAAACAAAAA 
ACCTTTATAA AAATTCATAC CTTTATGCTA TCGCTGTAGG CTCATTAACT TCTTACATAT 
AATTCTTAAC TATCCTTTGA TGATTGTTTT ATTAGATTGT rrCGTTGATG GATACTTTO. 
CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCCCAA AACTTTAATT TGTTTTCGAC 
1X.TTTTATAA TTATGCTTGC GATGGTGITT TAGriTCTGA AGTTTGTrCA GCAATGTCAT 
ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTC^rr TGATT"ACCAT 
CTAATAITAA TTCACTTAAA TTATCTTCGA TAGTTrrTTC TATCGCTCTA ATTAATGGTC 
TIGCACCATA TTCTGGATCA TATCCTTCTT CTCCGATTTT GTCTTTCGCT TTATCAGOTA 
CAATAATATT TATGTTTTGT TCAGATAATC GATTTGTTAA rTTATTAACC ATCATTGTTA 
CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAr A^^^r.^x^-... ^r^.r, 

-.*«*v-*i XV«Xrt\_^\„Vjrt^i 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 
CATAATCTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTr 
GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 
TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AATATTAAAT ACATCTCGAT 
GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 
TTAATTGTCC ACCATCATCA TGACCAACAT ATCCTGGAGG AGCACCAACT AATCGGCTCA 
CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCGCATCA TCATCGCCAA 
ACATTGA^rrc AGCTAAAGCT CTAGCTAATT CAGTTTTACC AACACCAGTT GGTCCAAGGA 
AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TCCACGGGCA CGTCTAACCG 
CTTTACTGAT TGAATTAACA GCATCTTTIT GCCCAATAAC TCTCTCATGT AATGTATCTT 
CTAGACTAAG AAGTTTTTCA GATTCTGrrr CATEXSATTIT AGTTAATGGG ATACCTCTCC 
ATCCTGCAAT AACTTCAGCA ATATCTTCTT CIX^CAATGA AGTTCaCATG CCAl^TC 
CATTCTTCCA TTCATTTTTA GCrrCTTCAT ATroCTTTTC AAGTrrTGTT TCTTTATCAC 
GCAGgTTAGC AGCATTTTCA AACTCTTGAG CATCTACTGC GGCATCTTTT TCATTTTTAA 
CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 
TTACTTTAGA ACTTGCTTCA TCAATTAAAT CAATTGCTTT ATCTCGTAAG AAACGATCTG 
AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATTAATAC 
GATGGTGTGC TTCGTAACGA TCTCTTAATC CTTTTAAAAT AGCAACTGTA TCTACTACTG 



55 
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10 



25 



30 



TAGATATTTT 


AAGCTGATGT 


TGTAATGCTT 


CGTTAGGATA 


TAATGCCAAT 


AGATATAACG 


3960 


TAAAGTGTAA 


GACAATTATC 


GTCATAAACA 


CACCAACTAT 


CATTCCCATT 


GCACGACTGA 


4020 


AAATATGAAT 


GTTTTGATAC 


GCTATTATTT 


TATCAAAAGT 


TACGATAATT 


AGATATAAAA 


4080 


TGAACTTACA 


AAACAATGTA 


ATCATTAAAA 


AAGCTACAAT 


CGCTTCAAAT 


CGATTTTGTA 


4140 


GATGATTAAA 


ATGAAACGCA 


AAAGTTGTAT 


TAAATGCTGT 


TGTTTTAGGA 


TATGGAATAA 


4200 


ATACAATTAA 


TCTTTCTACA 


ATAGATTTGT 


AAAATTGACT 


GGCAATCCAC 


AATGATACAA 


4260 


nCGTTGCACT 


CAAATGTATC 


ATAGATAACC 


AAAAACCTCG 


TCTGAATCCA 


ACGATGACAA 


4320 


AATACACAAA 


GAAAATGATT 


ATGATAAAAT 


CAATGACCAT 


TTATTG CTCA 


CGCTGCTGCA 


4380 


ATTTGTGAAT 


TTGTTGTTTC 


AAACGTCGAT 


TTTTTTTCTT 


CTAGTAGTAC 


TTTTTCATGC 


4440 


ATAATATTCA 


CAGCAGTTAG 


TATTGCTTTT 


CTTTGAAGTAT 


CTAAACCTGC 


TGCTTTATAC 


4500 


CCTAATTCTT 


TTATTTTATC 


ATCAACTAAA 


TGTGCTACAT 


ATCGTATGTG 


CTCTGGGTTA 


4560 


TCTTCCCCAA 


CAATTGTAAA 


AAGCTGATCA 


TTAATTGATA 


CATTTACCTT 


GTTTTTAAAC 


4620 


TGTGTCATTT 


ATAATTTCTC 


CTGATCCTTT 


TTTTAAAATC 


TAAATTCACG 


TTATAAAATA 


4680 


TGACTGGATA 


GTTTGTCTGA 


ATTTGATACT 


AATATTGTTA 


TATTGCAATT 


ATGATAAAAC 


4740 


AACAACACAA 


TCTCTATAGA 


TGACTTAATG 


TTCTTTTTAT 


AATGAAATAA 


TGTAAAGAAT 


4800 


TTTCTATTCA 


ATACTTTATC 


ATGTTTAAAT 


TGTGTCACTA 


TAACATTTTC 


ATAAACATTA 


4860 


TACATGACCA 


CTATGTATTT 


TGTAAGTATC 


CGCAATTAAT 


TCTTTACAAC 


ATACATAAAT 


4920 


GTTTCTGACG 


TTATTATCAT 


TTATGATATG 


ATTATTTTTG 


CTAAAGACAA 


TGAAATTTTA 


4980 


TGAAAGGATT 


TACACAAATG 


GCGAATATCG 


TTTTTAAATT 


GTCGGATAAA 


GACATAACGA 


5040 


CATTAATGTC 


ACGCATTTCT 


TTTGATACTG 


AGAATTTACC 


TCAAGGnATG 


AnAGCACGTG 


5100 


CAAAGTATCC 


AAAATACAAC 


TGTAAATATT 


TACCAT 






5136 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 
ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 


2160 


5 


CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCX3C ATCAGTGTTA ATAGTTAATr 


2220 




TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 


2280 




TAATTTCTAA TACTGTATTC GTrXCTTCAG CTAATGCCAT TAATTGTTCA ATATTCGGIT 


2340 


10 


TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 


2400 




GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 


2460 




GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 


2520 


IS 


AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 


2580 




TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 


2640 


20 


CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 


2700 


TTGCTTCTAC CATGTCTCGA ATAGAaAACG CACCATCACT ATACGTTGTA TGCATATGAA 


2760 




TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 


2820 


25 


CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 


2880 




CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTr 


2940 




CATCACGTGC TTTAGCAAGT TGTCGAATTC ttatattatg TTcrrrrGAc ccagtaaaat 


3000 


30 


GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 


3060 




ft ^^^^ ft ^^^^ H ^^^^ ^^mm^ * « - , 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCmTACIT 


3120 




TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 


3180 


35 


AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 


3240 




CGATATAATT TAATGTATCT ATATAATCAa' TGATTTCTTG ATTAAGTCrr CTCATTTGAT 


3300 


40 


CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTCTT 


3360 




CCGTTTTCIT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 


3420 




GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 


3480 


45 


GTCCTTGAAT TTTCAAAAGT GGAATAAQAC CTTCCGGAAC rrCTTCCTGT AATTGCTGTA 


3540 




AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 


3600 




ivauuiiiCAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATTCAAGAC 


3660 


50 


TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTrr AATTCCATAT 


3720 




AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 


3780 


55 


AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 


3840 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 480 

TCATTTATTT CTmTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 54 0 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

CCAATTTTTC TAACATGATC TAAAATGCTC ATTGCTAATG CAGCACCTTC ACTCGGATCT 780 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 96 0 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 1260 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 1380 

TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCXSATA TTCAGCTTTG 1440 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 1740 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCX3CCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 1860 
AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 
GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 
GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 

ss 
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20 



GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTCAnTr' 
TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 
AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 
GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 
GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 
CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGGAGGT TTAATTATTC CGGTAGAAAT 
CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 
,S ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 
TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 
TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 
TGTAGAATAA TTTTACGAAT ATTAAATTTfi aTTarttTr:iiT T-r» ^r. ^ ^ 

... • »™-w-»y.ei«iVj ««Vjl_ftUAl>\A 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 
TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 
GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 
TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 
ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 
AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTCTAT TACCAAACGG 
AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 
35 AGCAGCAGGT GCTGACTATG T 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 



4$ 



SO 



SS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 
CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 
ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 
ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 
ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TCTTCTTTAA CATCTGCACC 



60 
120 
180 

240 
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OrVATCriKCQlK, TTCCTGGAGA 


AATGCTTGTT 


ACTTTAACGC 


CTGTCTTTGC 


CAACTCTTTT 


660 




TCTAATCCTT GAGTAATAGT 


GTGAACTGCT 


GCTTTCGTCG 


CACTATAAAT 


CGTACTACTT 


720 


5 


TTCGTTACTT CAAAGCCAGA 


AATAGATGcA 


ATGTTAATAA 


GATGGCCACT 


TGATTGTTCT 


780 




AACATAGTTG GTAATGCAGC 


CTGTGCCGTA 


TATAAAGTGC 


CTTTGATATT 


CACATCAATC 


840 


70 


ATACTATCCC ACTCATCTAC 


TTGATAATCA 


GTAATCTTAG 


ACGACAACAT 


TTGCCCCGCA 


900 


CTATTGATAA CAATATCCAA 


ACCACCGAAT 


GTTTGTTGTG 


CAATTTTTAT 


CAATTCATCG 


960 




ACTTCTTCTT TATTCGTTAC 


ATCTGTTGGC 


ACTACCTTCA 


CACTATCTTG 


TGACAATTGA 


1020 


15 


TTCGCTACGT TTTGTAATTT 


TTCTTTATTT 


CTACCTGCTA 


AGACAACTTT 


TGCCCCTTCT 


1080 




TCATGTAGTA ATGTTGCAAT 


TGCTTCTCCA 


ATACCACTAC 


CTGCACCTGT 


AACTACTGCT 


1140 




ACTTTATCTG TTAATACTGT 


CATAATGATC 


GACTCCTTTG 


ATTCTTTTTA 


TTTTTTCAGG 


1200 


20 


GTAAATCATA AATACATATT 


ACTTTTAAAA 


AGCGTATCAC 


AATTCATATA 


ACGGTCATAA 


1260 




TAACTCGCTT CATTTTCATA 


GATAAATTAC 


ATTACAAGCC 


ATTCGAAACA 


TACAATTAAT 


1320 


25 


CGTTGCTTAT ATTTTTTATT 


TTTAAAAATG 


TTGAAAAATC 


GTCACTTCTT 


TATTGTAAAA 


1380 


ACATTATATT AGTAATAAAG 


TTAATACTGT 


GnATTTaTCA 


TTCGATTGAA 


TGATTAGAGG 


1440 




GAGGAATAAA ACgTGACATA 


TCATGAGCGT 


GTTTTAGCAT 


TAAGAGCAGA 


AAGTAAAAGA 


1500 


30 


ACCGCATTTG ATTTTCGATT 


CGAAGATTTA 


TTTAGCAAAG 


AAGAATGGcT 


AAGTATGTCT 


1560 




CTTGCAGAAA GACAAAAAGC 


TGAAAAAGCA TTTCGACACG AgTTAAAAAT 


ATGGACGATG 


1620 




TAAGAATGCC CTTCTCAAGT 


GTCCATGACG 


CCCAAGTAAA 


ATTATATAAT 


GTTGTATATT 


1680 


35 


CTTATAACGG CaTTAAACGT 


AATTTTAAAC 


AAGTTGAAAA 


TGGAAGGATT 


CTAATATCAT 


1740 




TTCGTTTATA TATnGCAGAC 


CATGGATAGA 


ATTTTnTATG 


GTnAATCC 




1788 



(2) INFORMATION FOR SEQ ID NO: 331: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
SO TAAGCCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTPiPAACPAA TGGGTCTTAA 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 
ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 

55 



1199 
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TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTCCAA ACTGO-TOCAC •rrrTA«3TAT " 
AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTITCGT 
CnCGTATTA GTTTAATCGC CATCATGTTA ATTCTATGTA TTCTAITAGT AATCATCATC 
ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC TACTAITCAA 
AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 
GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGITTATA AATTAGCAGA TTCAGATAAA 
ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTITAAT GATGAAAAIT 
CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTr 
GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 
TACTGCTGCT TTrrATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 

TgcgCTTTGC GCATGCATAA AAGCCCCTAA rAanr"rrt*r-/- n^r^.r^^^ 

— >^..ww * wfc^ A \jwvj\„ 1 u»wV»ui"r(JC»cC 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 
CTGCGCT^A GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 
CCTG.CGCTTT AnCATGGCCA TAAAAGC 
(2) INFORMATION FOR SEQ ID NO: 330: 

30 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



20 



35 



1800 

1860 
1920 
1980 
2040 
2100 



2187 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATT<3GGtTCT TTAGTATTAT 60 

40 ^ ^ 

GCTTAAGATA GAGTCTAATA CX3CTCCTGTT GTTCTTTATA GTATATTCCT TrrTGTTrrr 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TCACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 240 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGAITAT AAACCATAAC 300 

ACTAACATCG TCGCCTTCAT TAACTTGATT GITAAAATCA GCAATTACTG AGAATTCTAC 360 

so AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 420 

GATAGCTATT ACAITAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 540 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3642 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

TTATTGATAT TGAAAATTCA AAAACTOCAA CACATCTTAT CACTAAAAAC CAACACTATC 6 0 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 240 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 300 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 360 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 420 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 480 

ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

30 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 
3S CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 
TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 

TGAirrGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 

TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 1200 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 1260 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 1320 



720 
780 



55 
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35 



40 



SO 



55 



GTGACTAGGC CTTCCTATCA GACATATTCA CTCATCCACG TATCAITATC TGTACAGTGT 
GCTATCTCTT ATTTACCTAT TGGAACAACC ATAAACTCAT CCATAGTTTA CCTTTTATAA 
ATAGCAGTCC TCACTCATAC AATTTCTCAT AAAAATCACA ACGCTCCAAC GTATTTCCAA 
CTTACTTTCA CCTATTTTAA TTCATAAAAA CGACACrTTA AITGTCATTA TCCAATAATA 



I860 
1920 
1980 
2040 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT AAAATAATAG AATCATAGTA TT^^CATTTA 



10 " -™^x«rtx«o -kmh-ATACjTA TTGTCATTTA 2100 

AAGATAAAGT AAGAACGTTT TTATmTCA GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 
TTAGGAACGA AATTACATTG AAAAAACAAG TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 
TATTTTTTCG AGCCGGAAAT TTAATATTCC CGCCCATGCT TGGCCATACA GCGGGTCAAA 2280 
ATATGTGGAT TGGTATGCTA GGCTTTGCCC TTACAGGCAT ATnACTCCCC TTTATTACTG 2340 
TTATTCTTGT TGCATTTTAT GATGAAGGTG TTGAAAGTCT AGGCAATCGT ATACATCCAT 2400 
GGTTCGGGTT TATTTTTGCT GTCGTGATTT ACATCTCTAT Cfir,Ar.r,.n-T-r 

CACGTGCTGC AAATGTCGCG TACGAAATTG GTACAAGACA CATTTTACCT GTGCATAACC 2520 
AATGGACTTT AATTATATTC GCAGCAATCT TTTTTCCCAT CGTTTACTGG ATTAGTTTAA 2580 
ATCCATCGAA AATCGTTGAT AATTTAGGTA AATTATTAAC ACCGTTATTA CTATTAATGG 2640 
TCGCTCTATT AAGTATTGCT GTCATTTTCA ACCCTCAATC TCCACTAAGT GCACCTAAGG 
ATAAATATAT AACACATCCT TTCATTTCAG GAAGITTGGA AGGCTATTTT ACAATGGATC 
TTGTTGCTGC GTTAGCTTTT TCCGTAGTCA TTGTCAATGG CrATAAGTTT AAAGGCCTCA 
CAGATCGCAT GAAAATTTTA AAATATGTCT GCTTTTCAGG TCTTATTGCA GCCATATTAC 
TTGGAATGAT TTACTTTGCA CTTGCATACG TTGGGGCATC AACAGCTCCA GGAAACTTTA 
AAGATCGTAC AGATATATTG ACGTACAACT CATTACGATT ATTTGGTTCG TTCGGTAACC 
TCGTATTTGG AATGACGGTT ATCCTTGCAT GCCTAACAAC ATCTATAGGA CTCGTcAATC 
tTGCGCCACA TTTACTAAGA AACACGTACC TAAGTTTTCT TATAAAATAT TCGCACTTAT 
TTTCtCTATC ATAGGGTTCT TATTTACAAC ACTTGGTTTA GAAATGATTT TAAAAATTGC 



TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 3240 



TAACATGTTT AGCACATTCA GATTCAGTTG GGCCTATCGA CTCGCAACTG TTATTACATT 
GATTATTTCA ATTTTACAAA TACTAAATAG TTTCAACTTA TTACACGGTC TTATTrTGAA 
ATCGTTTATG ATGTTACCTT TAGCAGATAT CGATTTAGCT TGGCTTGTAC CATTCATGCT 
CTTTGCTATT ATCGGTTTCA TAATCGATGT ATTTATACGC CGTCCGAAAC AAGCGACAAC 
TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTtCATATGA 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



3300 
3360 
3420 
3480 
3540 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG CTTTATGCCC tTCtATTAAA CtCGGaACtA 120 

AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 300 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 360 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA AACTTCAATT 420 

TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

TAAACATTGT TTATATTCTT AGATGCACGT ACGTCGTGTT GCGCTCTGTA ATGTTATACA 540 

TACACTTATC CTTCATTATA CCCGAACTTT TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

AATTAATTGT CTATTATTTT TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

TTTAGGATCG AATTCCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 840 
GATATATTTC TTACCGAAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC GAGATTCAGC 
TGTAGCTTCT TTTGGATCTG TAAAGTCTAA TAAATTATCT CCAAATAGCA TGACAAGTTT 

ATGATCCTTT TCAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATCA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 1080 

CTTTAAATCT TTTTCTTTAT CTCTATCAGA AATATAGTAG ATATCGACAC CTTTTTTGTC 1140 

AGCATATTTC AAGAATTCTT TTOCGCCATA GACAGGTTTA GCTTTAGCAG cTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 1260 

ATTATCTAAA ACrGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 1380 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 1440 
CGCCTTTTGT GATGCTGGTA TAGAAGCTTG TTGCGTTTGT TGAACTTCAG CAGAACTTTT 
GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTCAAACT GTAACCGCTA CCGATAATGA 
TGCTATTGCA ATATACITTG AAATTTTATT CATCTTATCA CCTCATGATT AATATTTAAA 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA TACTATGAAA AGATATTTAA 1680 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 1740 

SB 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAA^AAa' 
TTTGCCArrr TTTTCGCAAT CTGTACAAAT TTTTCATCAT AITCTCCAGC TAATATTAAT 
GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 
(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2640 
2700 
2746 



60 
120 
180 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 
TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 

AGTTTGAATT TTAAAAGCAT CTTnTirrz^a-r T^r.r^rr^r^.r.^rr. 

^ ^ * * vj Art X X 1 X 1 1 A UAAATTCATT 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 
GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAA ACTATGTCAC AAGATGTAAA 240 
TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCAriTT TCCCATCACC 300 
ATATTCCCTT AGTCAATATA CAGCACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 
TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 
ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTACCTTT 
ACATCATTTA ATGGAAGCAG GTTTK5ACGT TGATGITGCG ACATTATCTG GTTATCCAGT 
TAAATTAGAA TTATGGGCTA TGCCAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 
ATTGAAAGAA AAATTAAAAC AGCCAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 
ACCTGATTCA GACTATTTAT CTGTCTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 
TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTCGGCA TTAGACAATG ACCGCTTTAT 
AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAACA GAGAAAAATC 
TCCATTAGAA GGATACTCTG TTTGTCTCTt CCCTXSaCTCA TTAGATGAAG GTGCAAATAT 900 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 2 base pair<^ 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



360 
420 
480 
540 
600 
660 
720 
780 
840 
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AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 
CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 
GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 
CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 
TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 
GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 
TATCAACTAC AATTTATAAA TTTAGTTTAC GAC7VCAACCA AACTCACACA TCTAGAACAA 
ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 
ATACGTCATG GCX3ATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 
CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT TCTTGATTAT 
GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 
CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 
TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 
TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 
CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 
GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 
CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 
ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 
CTGCTTTAAG GAATCX5TAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 
CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATCTAAAG 
CTTCTTGTCC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 
CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 
TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 
TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 
TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 
CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 
ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA ATCATTTCAG 
CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 
TCCCTTCGTA AAATTCATAT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 



840 

900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



55 



<iOOaO: <EP 0786519A2_L> 



1193 



EP0 786 519 A2 





ATTAGGGGCA GATTTATTAA 


AAGAAACTTT 


ACCATCTATT ATAGAGGGCA 


caaatgaaag' 


84 0 




CGTACCTCAA GATGATACGC 


AAGCAACATT 


TGCTTCCAAT ATTCGACGCG AAGATGAGCG 


Qn rv 


5 


AATTAGCTGG AATAAACCAG 


GAAGACAAGT 


GTTTAATCAA ATTCGTGGAT 


TATCACCATG 


U 




GCCAGTTGCT TATACAACTA 


TGGATGACAC 


TAACTTGAAA ATATACGATG 


CTGAACTCGT 




10 


TGAGACTAAT AAGATAAACG 


AGCCTGGAAC 


CATTATAGAA ACX3ACTAAAA 


AAGCCATTAT 


J- U 0 U 




TGTTGCTACA AATGATAATG 


AAGCTGTTGC 


AATTAAAGAT ATGCAATTAG 


CTGGGAAAAA 


A X ** u 




GAGAATGTTA GCTGCCAATT 


ATTTAAGTGG 


TGCGCAAAAC ACACTAGTAG 


GGAAGAAACT 




IS 


TATATGATAG AAAACGTGAG 


AAGTCTTGCT 


TTTGACACGA TTCAAGATAT 


ATTAAATGAA 


1260 




GGTGCGTATA GTAACTTGCG 


TATCAATGAA 


GTGTTGTCAG AAAATGAATT 


AAATGCAATG 


1320 




GATAAGGCTT TATTTACAGA 


AATTGTCTAC 


GGAACCGTTA AAAGAAAATA 


TACGTTAGAT 


1380 


20 


TTTTATTTA 








13 8 9 




(2) INFORMATION FOR SEQ ID NO: 326: 







(i) SEQUENCE CHARACTERISTICS: 
25 <A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 



30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 





TTTGCTAATA 


ACAATAAAGC 


CAAAGCCGAT 


TCACACTCTA AACAGCTAGA 


AATTAATGTT 


60 


35 


AAGAGTGACA AAGTACCTCA 


AAAAGTAAAA 


GATCTAGCAC AACAACAATT 


TGCTGGTTAT 


120 




GCTAAAGCAT 


TAGATAAACA AAGTAATGCA 


AAAACTGGTA AATATGAACT 


TGGCGAaCTT 


180 


40 


TTaAAATTTA 


TAAATTTAAT 


GGTGAAGAAG 


ATAATAGTTA CTATTATCCA 


GTTATAAAAG 


240 


ACGGTAAAAT 


TGTTTATACT 


TTAACACTTA GTCCTAAAAA TAAAGATGAT 


TTAAACAAAT 


300 




CAAAAGAAGA 


CATGAATTAC 


AGTGTTAAAA 


TTTCAAACTT CATCGCTAAA 


GATTTAGACC 


360 


45 


AAATTAAAGA 


TAAAnATTCA AATATCACTG 


TTCTTACTGA TGrAAAAGGG kTTTATTTTG 


420 




aAGAAGATGG 


cmAAGTTAGA 


TTAGTAAAAG 


CTACGCCTCT ACCTGGTAAT 


GTAAAAGAnA 


480 




AAGAAAGTGC 


TAAAACAGTT 


TCAGCAAAAT 


TGAaACAAGA GTTaAAAAAT 


ACAGTAACAC 


540 


SO 


CTACTAAAGT 


TGAAGAAAAC 


GAaGCGATrC 


AAGAAGATCA AGTTCAATAT 


GAAAATACAT 


600 




TAAAAAACTT 


CAAAATTwGA 


GaACAACAAT 


TCGATAACTC ATGGTGTGCA GGATTCAGTA 


660 




TGGCAGCATT 


ATTAAATGCa 


ACTAAAAATa 


CAGACACTTA TAATGCACAT 


GATATTATGC 


720 
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TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAATT GTCTTCTTAT ATCAACTTTT 444 0 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAGAAGTAAA 4 500 

CCCATAGCAA TACTTAAACT AATCATTGAT GATCCACCAA AGCTGATAAA TGGCAATGGC 4560 

ACACCAGTTA ATGGAATTGT TGCCX3AAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4 620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGCA 4680 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 474 0 

AAACCTAATT CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 860 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4 920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 503 0 
(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 25: 

CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

25 ATTACATTGC CAGATGTTTA CGGCGAAGTG ACAAGAAGTA AAATGATAGT TGTCGAAAGT 120 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 180 

TTGCATATTA TAGATCAAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 240 

ACAGATAAAG AAGTGGAGGC ATATTTTATA AATGACTAAA ATAATATTTA TGGGTACACC 300 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 360 

AACGCAACCA GATCGACCTG TTGGACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 420 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 480 

ATTAGAACAA TTGCTTCAAT TAGATGTAGA TTTAATTGTA ACTGCTGCTT TTGGACAATT 540 

ATTACCTGAA TCATTGTTGG CATTACCAAA TCTTGGGGCA ATTAATGTAC ATGCATCATT 600 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 660 

AACCGGCATA ACAATTATGT ATATGGTTAA AAAATTAGAT GCGGGTAATA TTATTTCGCA 720 

55 



45 



50 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA ITrTTTCTTC TGCTTGTTTA 
AATCSATATCG CGTGTGTAGA TAATTTTACG AGTAATCAAT CATAATAAGG TGATATCTCA 
* GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 
CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 
CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTrGTCTCT TAACAATATC AAITCCTGTT 
ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 
,^ TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTCCATTGA CATATTTAAT ATlTrCCATC 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATCTTG ATGATAATCC AACTCATGGT 
GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTCAAC AATCACGTTC AAATAAGTGT 
20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATCATmr CAGCTTCTGA TTTTGCTCTA 
TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 

2S 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTC TTTTGCTAAT 
TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 
GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 
AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTCTT CATTTTCACT TAAAAATCCA 
TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 
35 TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGGA1TC ATCTGCTTTA 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 
TCTGCCX3CCG CTCTGAATAT ACGAATTCCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 

40 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 
AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTrCmAA TITTCAAAaT 
TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 
TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 
AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 
so TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 
AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 
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CGTTTACCTT 


TATCGATTTC 


GATTTCTACT 


GTTTCACCAT 


TACGCATTCC AAAGAAGAAT 


840 




GTAGGCGTAT 


CAAGTAACGA 


TAAGTTTCCG 


TATTGATTTC 


TAGTTTGAAT ATATTGTTCA 


900 


5 


TATACTTTTG 


GATATAGTAC 


ATAACTAATA 


ATATCTTGCT 


CCGTAACAGG ACCTTGTTGC 


960 




TCTTCTTCAA 


GCAACTCACG 


GACTTTTTCA 


AAATCAACTG 


GCTCTAGATA TTCACCTGGA 


1020 


10 


CGAGCTGTTA 


GTGCTTCTTG 


GCCTTTTAAA 


ATAACCGCTT 


GTAAATCTTT ATTAAAACCA 


1080 


TTTACAGGTT 


GTCCTATTTC 


ACCTTTGAAG 


AACGACACTA 


CTGATTCTGG GAAATCTAAT 


1140 




TTATAGCCAT 


CTGTAATCAC 


GGATTGTTCA 


TCAAGATCAT 


TTTGTACCAT ATAAAGTGCC 


1200 


IS 


ATATCACCAA 


CTACTTTAGA 


CGATGGTGTT 


ACTTTTACGA 


TATCACCAAA TAAGAAATTC 


1260 




ACTCTGCGAT 


ACATATCTTT 


GACTTCATCA 


AATCTTTCGC 


CTAAACCTAA ACTTTTAGCT 


1320 




TGTTGACTTA 


AATTCGAATA 


CTGTCCACCA 


GGCATTTCAT 


GTTGATAAAT TTCAGTATTC 


1380 


20 


GGTGATTTGA 


TATCACTTTc 


AAAGTCTGAA 


TAATAAGTAC 


GTACAGTTGA CCAATAATGA 


1440 




CTAAGTGACT 


CCATACCTTC 


T^TATCAGTT 


CTAAGGTGGC 


GTGGGAAGCC ATTTAATGCA 


1500 




TAATATAACG 


AATTGGCGCT 


TGGCTGACTT 


GTTAAACCAC 


TCATTGAAGC AACAGCAGTA 


1560 


25 


TCAATGATAT 


CGACACCAGC 


ATCTATTGCT 


TGTTTGTATG 


TTAATAAACC ATTACCACTT 


1620 




GTATCATGAG 


TGTGAAGATG 


AATTGGTAAA 


TCTACAGCTG 


ATTTTAACTC ACCAATCAAT 


1680 


30 


TCGTAAGCGG 


CTTTAGGTTT 


TAATAAGCCT 


GCCATATCTT 


TAATCGCTAA AATATGGAAA 


1740 


CCTTCACGTT 


CTAACTCTTT 


AGCTAGTTTG 


ACATAATACT 


CTAAAGTATA AATGTTTGAT 


1800 




CX5CTCAGGAT 


TTAAAATGTC 


ACCTGTATAA 


CAAATAGTAC 


CTTCTGAGAT TTTGCCCGCT 


1860 


35 


TCTTGTACTG 


CTTCATTGGC 


AACTTTCATT 


TGATCTACCC 


AGTTTAATGA ATCGAAAATT 


1920 




CTAAAGACAT 


CTATGCCTGC 


TTTAGCACTT 


TCTTGTACGA 


ATTTATGAAT AACATTATCA 


1980 




GGATAGTTTT 


TATAACCAAC 


TGCGTTTGAA 


GCACGTAACA 


ACATTTGGAA TAATACATTT 


2040 


40 


GGAATAGCTT 


TACGTAGACG 


TTCAAGTCGT 


TCCCATGGGT 


TTTCCTTCAA GAAATTATAT 


2100 




GCCACATCAA 


ATGTAGCACC 


GCCCCACATT 


TCTAGTGAGA 


AACCATCTTT AAATACGTCC 


2160 


45 


GCTGTTTTGG 


ATGCGATATT 


AATCATATCC 


TTAGTTCTAA 


CTCGTGTAGC TAATAATGAT 


2220 


TGGTGTGCAT 


CTCTAAAGGT 


TGTATCTGTT 


AGTAAGACAT 


CATCCTGCTT TTTAACCCAT 


2280 




TCAGCTACAC 


CTTTTGGACC 


TACTTCATCA 


AGCAATTGTT 


TCGTACCACT AAATGAAGCX3 


2340 


50 


ATTTTACTTG 


AAGATACAGT 


TGGAATTGAT 


GCTAATTCAT 


AGTCTGGTTT CGGACGTTTC 


2400 




TCAACATTTG 


GGAAACCATT 


AATTGTTACA TTACCTATAT 


ATTCTAATGT TTTAGTACCT 


2460 




CTATCTAGAG 


ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT 


CTTCAATAAA TTTAGTTGTG 


2520 
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GATAGCTTTC TTTGTTCGCT 


GGAACJVTYSTA 


aavjAAATAAT 


ATCGGCATCT 


TTAATAGCTT 


420 


CTTTAACACT ATCTTTATAA 




V- lAAATCTTT 


ATTAGGATAG 


GCGTCATAAG 


480 


CTGTAATTGT AGCACCAAAT 




i c 1 1 AGCTGT 


AGCAGCACCG 


ATACGACCCG 


540 


TACCAATAAT TGCAACAGTC 


ATATm*T A A 


1 orOTTTAGA 


CATGATTTCT 


GCTTGCCAAG 


600 


TAAAATCATG TGCTTGTACA 


CGGCGTTP A A 


iAl CIGGGAA 


GCGACGCACT 


AATTGTAGGG 


660 


CGATAGArAC AGAATACTCT 


GGAATtGTTT 


w\vivj X QjAATA 


ACTAGGAACG 


TTAGATATCA 


720 


CAATATTGTG TlTTTTAGCT 


AAATP*T*A ART 


CATACATATC 


AAATCCAGCA 


GTACGTTGTG 


780 


CAATTTGTTT AATACCGTAA 


GATTCTAATT 




GTCATTTTCT 


AACTTACCAA 


840 


ATTGCATTGT AGTTACGCCA 


TCGTAATCTT 


TTAATTGAtC 


GACTGTAGCA 


CTTGATAATA 


900 


GCTCTTTAGA AGTAGTTACT 


TCGACATTAT 


TCTtTtTCCC 


CCAATTTAAT 


GCCATCTCTT 


960 


TCTCATAATC ACGCGTaCCA 


AAGAACATAmA 




TAT 








X \-j-ix 




1003 


(2) INFORMATION FOR SEQ ID NO: 324: 









(i) SEQUENCE CHAJtACTER I STI CS : 

(A) LENGTH: 503 0 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


324 : 






GGCTTTTATA 


GTGTTTACGA 


TTATTATGAT 


TACTTATATT 


CACGCTGTTA AAAATTATCC 


60 


35 


AAATAACCGT ACTGTTCATT 


ATGGTTATAC 


AGCTGCATTT 


ATACTTGTTA TATTACAAGT 


120 




TATCACAGGT 


GCATTATCTA 


TTATGACAAA 


TGTTAACCTG 


ATAATCGCAC TTTTCCATGC 


180 


40 


ATTATTTATC 


ACTTATTTAT 


TTGGTATGAC 


AACATACTTT 


ATCATGCTAA TGTTACGATC 


240 




AGTAAGAAGT 


GACArGCAAT 


AACaAAAAAG 


cmAACCGTAA 


TTTTAATGGC ACGCCCATTA 


300 




AAATTACGGT 


nTTTTATATC 


AATATTTAAA 


AtTAAACCtA 


AGCCATGTAA AAACGAGATT 


360 


45 


ACACGTCAAT 


TGTTGTGTAA 


TCTCGTTTTA 


TnTTAATCAT 


TTTAGTCAGT TGCTTTTTCA 


420 




ATTTCGATTA 


ATAAATCGCC 


TGTCGCTATT 


GTGTCACCAT 


TATTTACAGT TACTTGTTTA 


480 




ATCACACCGT 


CAAATGGTGC 


TTGAATTGTT 


GTTTCCATTT 


TCATAGCTTC AGTAATTAGC 


540 


SO 


AACGGCTGAT 


TAGCTTTCAC 


AGTTTCACCT 


ACACTAACCT 


TGACTTCAGT TACTGAACCT 


600 




GGCATTTGAG 


CACCGATATG 


ACTTGGATTA 


CTCTTATCTG 


CTTTTGGCTT AACGTTCGCA 


660 


55 


TTTGTATGCA 


CATTTTCATC 


TTTAATGTAA 


ATACGTCTCG 


CTTGACCATT CATCGCATAG 


720 
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TGTAGTTGTT GAAAATATTT ATCGACGCTT AACAGATTCA GAAGAACAAC TAAAAGGTGA 300 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 360 

TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 420 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT CAATTACACT 480 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 54 0 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATT GGTCATTAAA 600 

TCATAAGTGG ATTGTAATTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATTTGG 660 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAAATTTT TAGCTATTAC 720 

TTATACACCG AAGCCTCGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 780 

AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 840 

TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGTTGAAT ATGATAATGA 900 
CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 960 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 1020 

TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 1080 

AATGAAACAG GTTAAAGGAC TAGCCAATGT CAAATCTGAT TTATCGCAAA CATATGATCA 1140 

GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 1200 

TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 1260 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA GC 1302 
(2) INFORMATION FOR SEQ ID NO: 323: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 
TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 
TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 160 
TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 
ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 



60 
120 



240 
300 



1187 
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GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA 
TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT 
TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT 
ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT 
TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC 
TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT 
AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT 
CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT 
GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA 
ATTAGCTGGG AnTCTGGTT GATATAAAAC GATATGATTT 
TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC 
CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTTGACC 
CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT 
CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG 
AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT 
GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT 
TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA 
TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG 
TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT 
TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 
GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 
ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 



ACCATTTAAC 
GAAAATGAGT 
TAACAACGCA 
TAACGCTGTG 
AAAAATGAAG 
ACCAAATTTT 
ATCATGATAC 
ATCATCAGTT 
CGTACGgCAA 
GTCATATTAC 
ATCCTCTTGA 
AATGGCCCAA 
CAATTCTGGA 
TATCGGCTTT 
ATTAGACATA 
GTCATCTTGT 
ACCATATAAT 
CGTTAAAAAA 
TGGACAACGA 
GTTCC 



TTTAACATGC 
AAATTTTAAT 
GCAGTATTTG 
TCTTGATATT 
TAAAAATCTT 
GTTAATAAGT 
GTAATATTCA 
CTAAATCCAA 
TGTTACCAGT 
TATTCTCTCC 
TCATAATTAG 
TAAGCAGTTG 
ATTGCATCCA 
TTACCACGCC 
CGTAGtAAAG 
TCGGTATTAA 
CTATTCCCAA 
CTAATGCATT 
TCAACACAAA 
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TACAATACCT 
TTTTGATAAA 
AATTGCCAAT 
TTGAATCCAG 
GCATATCATA 
TGATAGATGA 
TGTTTTAATA 
TTTTATGCTT 
CGGAATGTCA 
CCCAATTTCT 
AAGTCAGCTT 
AATTTCTACA 
TCAGCTTAAA 
TTCTACAAAC 
TTCTATGAAT 
AATCAATCCT 
TATTACACAA 
TGATCAGAGA 
ATACTGTTTT 
ACCTCACAGA 
AAACCTGCAG 
GCTGGTTCGA 
TTAGCACCTG 
ACATTTAACT 
TCTTTAGTTT 
TAAACTTGAC 
AAAATATTTA 
AGAACTGGCG 
AATTGCTTTC 



GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 
TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 
GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 
TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 
ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 
CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 
TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 
TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 
GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 
ACAGACATTG CAAATTGGGG AAACX5GGCCA CAAACTCAGA AACTGGTGGA 
AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 
GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 
ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 
AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 
ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 
ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 
TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 
TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 
CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 
TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 
CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 
AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 
TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 
TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 
TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 
CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 
TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 
CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 
TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTCGTTTAA 
TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTCCAGAT TCTTTTAAAG 
TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTOOAGCTGC TTTrrCATAA CCAATATCTC 
GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA •TrCTCTTCGA 



3240 
3300 
3360 
3420 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGriTC CATACCATCA GCTAAAAGAT 



3540 
3600 
3660 
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10 --^v,w. v^xn^irAT TAAATGTTTC CATACCATCA GCTAAAAGAT 3480 

AAATTGATTG TAGTGTATTA TGCATAATAA C7XX3TTTATA AACATTCAAT TCAAAGTTAC 
CTTGTGAACT TGCGAACCAA CAACTX3TATC ATTACCCATT ACTIXSGACTG CnACCATTGT 
TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 
TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 
ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATCCA A^TGAACAAC 
^0 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGA1X3CGCAT TAATACCAGT 
ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 
TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTCTCCTA GTTTCATCGG 
CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 
TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTOTGTTT CAACCTCTOG 
ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 
ATCATCATTk GGGTGGATAC TTTCATCACT TTX3ATGATCT TTTAAATACA TATTAGCAAC 
ATAACTTACT ACTTCGTTCA CATTCATATT ACTrTGTCTA CCGClTCCrG TTTGCCATAC 
AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 
GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTCCTCCACG 
CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTCGGAA 
ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATAnTA TCTGCAGGTA CTrCTATTTC 
TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTCTTTAT 
TTAACTGTAG TATATCATTA AATAATTTAA TTX3AGCAATT TATGATTAAA ACGTTTTCAT 
AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 
TACGArrAGT GTGTTATCTA TCATTATTTA GTTAITATTG AACTAAGTTT AATTACGATA 
CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTCTAAGT ATATATAATA CAGCACTAAC 
AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCIAAA ACTCCTCTTA AACTCGCATA 
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TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 
GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 
CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 
GCTCTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 
ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 
TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 
GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 
TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 
TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 
ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 
AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 
TTAATGATAA ACCTGCGCXSA TTAATGGATA AAGTGCATAC TGGCGACGAT GTATTTGTTC 
CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 
AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 
TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 
AACCAATTCA TCXy^CTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 
TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 
CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 
ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 
TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCtnACT TGAAATCAAG TTGOATACTG 
GACGTACTCA TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCCTGTT ATTGGTGATC 
CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 
TGCATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 
GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 
TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 
GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 
GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 
TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 
TAATAAAACG GACTCCATGA AAAGTTTT T C CTTTTCATGG CTCTATATCA AATCAGACTT 
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5897 

(2) INFORMATION FOR SEQ ID NO: 321: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 





(Xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 


321 : 






IS 


AL-ftiiL-iAAA TCAACTTATT 


TTCCATTGCA 


TAAATTGCTG 


CTTGTGTACG 


ATCGCTAACT 


60 




luxMAi i lAAATATATG 


ACTGACATGT 


GTTTTAATTG 


TTITI'TCAGA 


TACAAATAAA 


120 




o X 1 1 i„ 1 J, c rC ill ATT 


TGTTTTACCT 


TTAACCATTT 


CACGTAACAC 


TTCAATTTCT 


180 


20 


i i 1 i\yACA ACTTATTCGT 


GTAGTGTGGT 


TTTTGGCTAA 


CTGTTTCGAA 


TACATCTTGT 


240 




^^-U-i lAvivjAT GTATCATTTT 


TTCACCGTTC 


ATAACTCGTC 


TAATAGTTTC 


AATTAATTGC 


300 




x*j>\ovjV_il.aa CGTCTTTCAT 


TTCATAACCA 


TCAGCACCTT 


TATTGATTGC 


TGAAATTACA 


360 


2S 


J. vj 1 X X ^-M X ^.-AAClft, X AACT 


TGTTAATACC 


AAAACTTTAA 


TATCCGGATA 


ATGTGCCTTA 


420 




'^•'•'^A'** xv-*,^\j XAAx X X^C-AAX 


ACCATTCATG 


CCAGGCATCA 


CTAAATCTAA 


TAGCACAATA 


480 


30 


TCAGGGTGr*T' d^Tr^r^r^T^i - 1 * 
* ^— '^vjvTVj X \jv» X v.-rt.x\jt- X ^ i iTT 


TAAATATTCT 


AAAAATGTTT 


CTCCATCTGC 


AAAGTCTTGT 


540 


AAAACnrOTA 'TY^T*T»*T»Toi\ -^*Ti 

rwLTW, ilV-lrt XVjXXII L-Aa X 


CGTGGaTAAT 


AAAAATCGCA 


ATCCTTGtCG 


CACAATATAA 


600 




kGGwCATCTA CTAATaTnar' 


TTTGTTCATG 


GGTTATCTCC 


TTAAAtCAAG 


CTATTTTATA 


660 


35 


GGAATTGTGA ATTGTATTTG 


TGTACCCTTT 


GTTGGCTGAG 


AATGAAAGGT 


CACTTTACCT 


72 O' 




CTTAATAATT TAACTCTTTG 


TTTTATGTTA 


TTAATACCGT 


GTGATGAAGC 


TATCTGAACA 


780 




TTATCGATCT CAAATCCTTG 


ACCATAATCA 


ATCACGTCAA 


TATATAGTAT 


ATCGTTCATT 


840 


40 


TGTTTTAATG TAAGATCCAT 


TTTATTCGTA 


TCAGCATGTT 


TCTTAACATT 


ATTAATACAC 


900 




TCTTGTAATG CTCTGTATAT 


GTTTTCTTCG 


ATTTCATTAG 


ATAAATCGAT 


TAAACCTTCT 


960 




ACATTTACAT TTAATTGTAT 


ATGCATTAAT 


TTACTATATG 


CTGTCAAAGC 


ATGAATTAAA 


1020 


45 


CCTTGCTCAA GTCCAACTGG 


CTTAAGTnGC 


CAAATCAATG 


CACGCATTTC 


ATTAACGGCA 


1080 




TTTTGACTCG TTtCCTCAAT 


CGTCTTGAAT 


GCTTGTTTAG 


CGATGGATTC 


GTTTGACATG 


1140 




CCATACGCAG CATGlXSCtGT 


TAGTTTTACA 


GAAAATAACA 


TTTGATTTAC 


TGAATCATGT 


1200 


50 


AAATCTCTAG CTAAACGATT 


ACGTTCATTA 


ATTTTTGCCG 


CTTCTTTTTC 


ACGGTCTGTT 


1260 




AAATAAATAC GTTTGATGGC 


TGACCCTAAT 


TGAAATGCGA 


CAGACTCTAG 


CAACTCTAAA 


1320 
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TGACTAAAGG TAAAGAAGTA C3AAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 
ATGGTATTCC ATXSTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT GTAATTGAAA 
GTATGACATT TACAATGCGT CAAATCTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 
TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 
TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 
GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 
GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 
GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 
CTAGCTAAAT TGAATGTTCA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 
TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 
CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACA AACATCGATT 
GAAGAGGCCG rTTTAAATrA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 
TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 
ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 
GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT TACACAAAGT 
GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 
AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 
CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCGTGTTACG TTAGGTTATC 
CTTTAGTTCG AGGCX3CAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 
AA6TAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 
AAAAATTAAA ATTACCAATG AATTATGTTC GTTCATCAAA TAAGAGTCAT GGTAAGCAAA 
ATCAAATCGA AGGTCCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 
CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 
TAGGTCTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 
TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAmTG 
AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 
AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 
TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 
TAGTCACGAT GTCGACGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 
GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 
ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTC 
AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATC 
GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 
TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTCAAT 
TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 
,5 CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTSTAG 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 
TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 
?0 ACTTTGAACC TTTAACTGAA GAAGATGTCA TGAATATCAT TAATTTAGAA AAACCTAAAC- 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 
ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTCCTGAA GATAGAAAAG 
AATTTGAAGC ACTATTAAGA AAAATTAACG TCCCACAGCC ACAAGGGAAA ACAGCTACAT 
CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 
CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 
ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 
CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 
TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 
CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 
AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGGTCTGT 
ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 
ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTCG GGAAAAACTA ACAGATATGG 
GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 
Gn-TTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TCAAATGAAG TCAACAGGTG 
AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATr CAAAGGGTTA ACAGGTAGTG 
GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 
AAGTTGTTAA ATTGGCACAA CGCTTAAATC AAGTTCGCTA TAAAATTTTA GCAACGTCTC 
GAACAGCTAA TAAATTAGCT GAGTATGACA TACCTCCAGA AGTAGTAGGC AAAATTGGTG 
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AGTGTCGTAC TTGTAGACTT TCGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 
GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 
CCAGATGGCG TTATGCTATC MACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 
CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 
CrrTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 
AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 
TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 
AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 
CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 
ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 
AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 
TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 
TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 
GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 
AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 
TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 
CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 
TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 
TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 
ACATCAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 
ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 
ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGirGGTA TTCATACAGG CGATTCAATT 
GTrGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 
TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 
CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 
TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 
CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 
ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 
GAACGAGAGC TTGGCACACA AATCAAAGCA ACAGGTGAAG TTATOGCCAT TGGTCGAACT 



540 
600 
660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


319: 








AAT ATTG a Ta 


TTATATATAG TCGAATCaAT 


GATGACCCaC 


ATCGACCtAA 


TGCAAATAAT 


60 


5 


AAGACAATCA 


AGCAGTTAAA TGACTTGTAT 


TGCTCGCGGA 


ATTTAAGATA 


TAATGAAATC 


120 




GCATTCAAGA 


AATTTGATAG TCATTTGCTA 


TCAATTTCAG 


AAATATATTA 


TGAATTGCTA 


130 


10 


AATTTAATAA 


AAGCGAGTGA TCAGTATTAG 


AGAGAATAGA 


GCGTTAAGAC 


TCTATCGCCG 


240 




AAGtGCAAGT 


AATTTATTAC GAAACTCTCA 


GGcAAAAGGr 


TAATACTGTA 


ACGCGTTCCT 


300 




GAATTGGTGA 


TTTATAAACA GGGTAGCGAT 


TGCTATCCTG 


TTTTTATAAT 


TTTAAGGGGG 


360 


IS 


TATTTCAATG 


TCAAGTGATT TAAAACAAAC 


ACCTTTATAT 


CAAAATTATG 


TTGATAGAGG 


420 




TGCAAAAATT 


GTGGAATTCG GAGGATGGGC 


GATGCCTGTT 


CAATTTTCAA GTATTAAAGA 


480 




GGAGCATAAT 


GCTGTTCGAT ACGAAATTGG 


CCTGTTTGAT 


GTTAGTCATA 


TGGGTGAAAT 


540 


20 


TGAAGTAACA 


GGTAAAGATG CTAGTCAGTT 


TGTGCAATAT 


TTATTATCAA 


ATGATACTGA 


W W V 




TAATTTAACT 


ACTTCAAAAG CATTATATAC 


TGCTTTATGT 


AATGAAGAAG 


GCGGTATTAT 


660 




TGATGATTTA 


GTAATATATA AATTAGCTGA 


CGACAATTAT 


TTATTAGTTG 


TTAATGCTGC 


720 


25 


TAATACTGAA 


AAAGATTTTA ATTGGATTTT 


AAAACACAAA 


GAGAAATTTG 


ATGTTGAAGT 


780 




ACAAAATGTA 


TCAAACCAAT ATGGTCAATT 


AGCAATACAA 


GGACCAAAAG 


CtAGAGATTT 


840 


30 


AATTAATCAA 


TTAGTTGATG AAGAkGTAAC 


TGAAA 






875 


(2) INFORMATION FOR SEQ ID NO: 320: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



45 



50 



TCTi-rrrACG 


AAGGCTACCG 


TTTAGGATCT 


GATAACTTAA 


CTGTAGGAGA 


AATTGTATTT 


60 


AATACAGCGA 


TGACAGGTTA 


TCAAGAAACT 


ATTTCAGATC 


CATCATATAC 


AGGTCAGATC 


120 


ATTACTTTTA 


CGTATCCATT 


AATCGGTAAT 


TATGGTATCA 


ATAGAGACGA 


TTTTGAATCA 


180 


TTAGTACCTA 


CATTAAACGG 


TATTGTAGTG 


AAAGAAGCGA 


GTGCGCATCC 


AAGTAATTTT 


240 


AGACAGCAAA 


AGACACTTCA 


TGACGTTTTA 


GAATTGCATC 


AAATTCCAGG 


GATTGCAGGT 


300 


GTTGATACAA 


GAAGTATTAC 


GCGTAAAATT 


CGACAACACG 


GTGTGTTAAA 


AGCTGGTTTT 


360 


ACTGATCGAA 


AAGAAGATAT 


TGATCAACTT 


GTCAAACATT 


TACAACAAGT 


AGAATTACCT 


420 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 


3120 




AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 


3180 


5 


ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 


3240 




ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 


3300 




TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 


3360 


10 


GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCAl'i-rrr AAATGCTGAT 


3420 




TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 


3458 


IS 


(2) INFORMATION FOR SEQ ID NO: 318: 




20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 695 base oalrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 




25 


GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT TATGAATATG TTCCAAAATC 


60 




ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 


120 




TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 


180 


30 


ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 


240 




ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTTCGGACG 


300 




ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 


360 


35 


CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATTTGTTGA TGCGTGAAAT 


420 




ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 


480 


40 


TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 


540 


CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT i-i i l iATTTT 


600 




TX3TAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 


660 


4$ 


GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 
(2) INFORMATION FOR SEQ ID NO: 319: 


695 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 875 base pairs 
so (B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
( D ) TOPOLOGY : 1 inear 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA rTAGTTCTCT 
ATCAACTTGA TCGGTTAGGT AGAACGACAA AACTArraAT TGAATTaTCA CAATGATTCG 
ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAACGAAA GACACAATGG 
GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 
GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 
CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TCATCAACAA ACGATAACAG 
GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 
AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TCCATGTTAT AATATTTATT 
GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTCATCCCT ATGGTTATAG 
TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATCCTTATGG TTGTATTAAC 
AATCATTGGT TTAGTATTGA TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 
TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC TTTTTAACGG 
GCTCATTAGG GTAACATGTT TCCGAGTCTT GCCCTTTTTC TCTTTCAAGA GTTAATGATT 
TTCATCTT-rr GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCrr ATGAATTCTA 
GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAGCAGAAA ATACAATTAA 
AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 
AATTCACTTA GTTATnTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 
ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 
AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TCTTCTATTA 
TCATTTAATG AATCTTCTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA 
TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTr TATTCTTGTC AGCTCTTTTT 
ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTCAACGAT ACCCATTAAA 
ATGATTGCAT GAATAACAAC GAATAAAATA ATTCTCATTA CTGTATAAAC TCTTATGATA 
TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATC AAATCAAGTA ACCAACAGGT 
GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 
GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 
TTAGTTGATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 
ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATCC ITTGAGTTGT AATAGCAATA 
CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTC AATTCATAAT CATTTCTGAT 



2S 



30 



ss 
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10 



GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 
CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 
AATTGATAAA 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



9240 
9300 
9310 



IS 



20 



2S 



30 



35 



40 



4S 



SO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 
CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 
ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 
AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 
TATTX3ACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 
TAAGAi\AGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 
TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 
CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 
GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 
AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 
GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 
AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 
GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 
AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 
TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 
TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 
TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 
TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 
AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 
GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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w 



15 



20 



25 



30 



35 



40 



45 



SO 



GCTAGTAAAG ATGCGGGTTC 
GCTGCGATGA CAAAAGCGAT 
AGTTTGAATC AGATTTGGAC 
TATGGACAAA TTGAAGTCGC 
GATACGGCAA AAGTGTCACA 
AAAAAAAGTA ATACGTCTGC 
GAGAAGGCGA TTAAAGCTAT 
CATTTTATAG AAACTTGGCC 
TATACGAAAA TTGAAGATAA 
GCACATGTGA AAGATGGTTT 
AGTTATAGCT TCGTCGATGT 
ATTGTAATGA CATTGACTAC 
GTGATTGGTG GTGCAATTGC 
GAAACTTTAG GGAATAGTGG 
GCGGTCATAT TAATGTTTAT 
TGGAATGACA TGATTAAAAA 
TTAGCGACGA TTGGTTTAAT 
ATGGGGATGA TAGGTGAGCT 
GTTATTTTAA TCATCTTTGC 
TATATATTTA GAGTGTTGTC 
AGTATTCAAA AGTTACAATT 
ACGATTAACT GGTTGGGCTT 
ATTATX3GTAG TTGCTATCTT 
GTTTTTTTTG TTGCTATATT 
TGATTCGTAT AAGGTTAATG 
ATACTGAACT CT^GAGCAA 
GGTGGTGCAA TGATACTTGC 
TTCAATGATG TGGTATCTAT 
TGAAATAGAA GACCAAATGT 



TAAAATTAAA 
TAAAGATAAA 
AAGTAATGAA 
ATTATTACAA 
TGCTTGGACA 
AAATGATCAA 
CGACGACAAT 
GTATGTTGAA 
AATACCATAT 
AGTAGATTTA 
GATGATTATC 
CATGACGCGT 
CGGACTTGTA 
CATTCTTCGT 
CGTTGGTGTT 
TATGTATGCT 
ATCTGTGTTG 
AGCGACCAAA 
ATTATTATTT 
GATCTTTATT 
ATTAGGTGCG 
TTATCCAAGT 
AATCTTTAAA 
GGAAATATTC 
TGTTGGCGGT 
TATGAGTGGC 
TAATACAAAG 
AAGACCGAAT 
GAATAACCAA 



CTATTGCAAC 
AATAAAGCGG 
ACAGTGATTC 
CTTAGAATTG 
ACTTTTAAAT 
TACCATGTAT 
CAATTGTCGG 
GGTCAAATTC 
TATCAAAGTG 
AATAACCAAA 
TTTTTACGTG 
AATGTAAAAG 
CTGAGTATTA 
GAAAGTATGG 
TGGATGCACA 
AATGCGATTA 
CGTGAAGGTG 
GATTTTATTA 
AGATTTATAG 
TTTATTATGG 
ATGCCAAGAC 
TATGAACCAT 
TTTAAAAAAT 
GGTATTGCAG 
TTGCCTCGGC 
AATGTGAGTA 
CAAGCGAAAT 
TTTGCAAGAC 
CGTATGAGTT 



AGCAAGTCGA 
AATTAGAATC 
GCAATTATGA 
CAATTCATAA 
CAAATATTGA 
CACAATTAAA 
ATGCTGcaTg 
AAACTAAAGA 
TATTAGACGA 
TTAAAGAGGT 
AAGGGCTAGA 
ATAAGAAAGG 
TCTTAGCAAT 
AAGCGGGATT 
AACGTTCAAA 
GTAATGGTAA 
TCGAGGTTAT 
TTGGTATTGC 
TTAAATTAAT 
GATTCAAAAT 
ATGTTATTGA 
TGATAGCACA 
AAAAAACAGG 
TATAACGATA 
ATGTGAACTT 
ATATATTTAA 
AAATATATTT 
CCATAAATAA 
CAATCAAATA 



TGCTAAAGAT 
TTTGAACAAT 
CGCAAATCAA 
GTCACCATTA 
TCATGTCGAT 
TGATGCGTTA 
TGCGCTTACA 
CGGTGCTTTG 
ACATAATAAA 
TGTTGGCCAT 
AGTGTTGTTA 
GACTGCAAGT 
TACGTTTGTA 
AGGTATCGTT 
TGCAAAACGT 
TTTGGTATTG 
CATTTTCTAT 
TTTAGCTATC 
ACCTATTTTC 
GCTTGGCGTA 
AGGATTCCCA 
AGGTGCTTAT 
CCGAGTGCCT 
ATCACAGCAT 
AACGATGAAC 
TGTTAAATCG 
ACGrTAATGT 
TATTGGTAAT 
TGCTTTAAAG 



7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 
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15 



20 



25 



30 



35 



40 



45 
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GTTGAAGGCG 
AAATTAAGTG 
AAAGATAACA 
TTTGCX3GATG 
TGACAAATTA 
TGTTAGGTAT 
CTTTCAAATC 
GTAAAGTGCA 
ATTTGAAGTC 
CTGATCGTAT 
CAGTAGATAC 
TTAGTAAGTC 
ATTTACCGCA 
TTCAAGCATG 
CATTTCGAGA 
AGGAAACACC 
ATCAACTTAA 
TTGTCAGACG 
AGGCTACATT 
ATGAAATTGA 
ATACGAGGCT 
ATGGTACGGA 
AAGCGACAAA 
AATATATTAC 
GATACCTTGG 
GTGTGTAGCT 
TATAGTGTGA 
AAGCAGAAAG 
AGTGAAAGTA 



CACAAAAAAT TTATGACTTA TTTAAACCTA TTTTAGAGAA AAAAGATAAA 
ATGATATCCA AATGAACTTC GATAAAGTGA ATCAATTATT GGATAAATAT 
ACGGCGGTTA TGAGTCATTT GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 
CTGTTAATGC ATTAGGAGAG CCACTAAGTA AAATGGCTGT GATTACTGAA 
TGAACAAGTT AACGATAGTA CGCAATTTTC AAGACGTACA TTTTTGAAAA 
TGGCGGTGCC GGTGTTGCAA TTGGCX3CAAG TGGTGTTGGT AGCATGTGGT 
AATGTTCAAT ACACCAGAAG ATCCGGAAAA AGATGCGTAT GAATTTTATG 
ACCAGGCATT ACCACACCCA CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 
AAAAGATAGA GATGCAATTA AGGCAATGTT TAAAAAGTGG ACGGTTATGG 
GATGGATGGT GATACAGTTG GCAAGCCX3AG TAACAATCCT TTAATGCCAC 
CGGTGAATCG ATAGGATTAG GTGCAAGCAA GTTAACGATT ACCTTTGGGA 
TTTGATGAAG AAAATTGGGT TATCTAGTAA AATTCCCGAT GCCTTTAAAG 
TTTTCCGAAT GATCAGTTAA TAGACGATTA CAGCGATGGT GATATTATGA 
CTCAAATGAT TCX3CAAGTAT CCTTTCATGC GGTTCATAAT TTAGTTCGTC 
TATTGTTAAG GTACGTTGGG CGCAATCTGG TTTTATCTCT GCTAAAGGTA 
TAGAAATTTA ATGGCATTTA AAGATGGAAC AATTAATCCT AGGAAGAATA 
AGATTATGTG TTTATTGATG ACGGATGGGC GAAACATGGA ACTTATTGCG 
TATTCAAATA CACATTGAAA CGTGGGATCG TACTGCGCTG GAAGAACAAG 
TGGTCGGAAA CX5ACATAGTG GTGCACCGTT AACAGGTGGG AAAGAGTTTG 
CTTAAAAGCG AAAGATAGTC ATGGCGAGTA TATTATTGAT AAAGATGCCC 
AGCGAAAGAA GCAAATACGT CAATTTTACG TAGAGCCTTT AATTATGTGG 
TGACCGCACA GGTAACTTCG AAACAGGCTT ACTTTTTATT GCTTTTCAAA 
ACAATTTATC GATATACAAA ATAATTTAGG TAGTAATGAT AAATTAAATG 
ACATAGAGGT TCTGCTTCAT TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 
TGAAACATTA TTTGACTAAA TTTGTAGCAA TGCTAATAAC TGCTGCTATG 
TTGGGTTACT GAAAAGTCAG GCAGCAGAAC AACAAAGTAT TAGTGATGTA 
TAACGGATGC GAAATCTGCA CTTTCTAATA ATTCGATATC GAATGACAAT 
CAATTGAGCA AGTGGTAAGT GCAGTTAAGA AATTATCGCT TGAAGATAAT 
ATGCTGTCAA ATCAGATGTG AGAAAGCTTG AAGATGCAAA AGCGAATGAT 



5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 
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6900 
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7020 

7080 

7140 

7200 

7260 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT AAAGTAGTCA TTAAATTAAG TTGTCTCATA 
GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTCGCG GGGCCCCAAC 
AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATCTCCAAG TTGGGGATCG GCCCCAACAT 
AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATCCAAGTT GGCGGGGCCC CAACATAGAG 
AATTTCGAAA AGAAATTCTA CAAGCAATGC AAGTTX3GGGA AGGACAACAA ATTTAAGATA 4080 
CAATGCGTAA CATTAATATG TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 4140 
TGAGGATGTG ATGGTATGTT TGGAATGAAA GTGAATGAAC AAATAACATT AAAAATrTTA 420O 
GAAGCTCATG ACACAGAAGC GCTTTTCAAT TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4260 
GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATACGCGTCC ATTTATTAAA 4320 
AGAGGACTTT TGCAATTTGC TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 4380 
ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA TCCACAGAAA AACTTCAl^A 4440 
GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGATTA -rcACACAAGC AGTTGAGGCA 4500 
TTGATAAAGT ATTG-rTTCGA AGAGCTTGAC TTAAACCGAA TTOAGATTAG TGCCGCAGTT 45G0 
AATAATGAAA AAAGCCGGGC TATTCCTGAA AGGCTCGGAT ITACTAGAGA AGGTATCTTA 4S20 
CGTGACAATG AATTACTAAA TGGTATTTAT TCATCGAGTT ACATCTATAG TTTATTAAAA 4680 
TCAGAATACG ACCAAAAATG ACAAATTAGA CTTACAAAAG AG-^ATGACA TTTAAAATGG 4740 
CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4800 
ATATAATGAT TTTGATAATT ATTATCAACT AGATGATCTT TATCGGAGGA TGCTTTAAAA 4860 
CAGCCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATCCGAAAA TAATATTTAT 4920 
AAGAACACAT CTATGGGGAT AATAGAATTT CTATAATCAG GTCTCAAAAT GAAAAAGTTA 4980 
ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTCCAT GTX^GGAACGA CGATAGTAAG 5040 
AAGGATGATT CAAAGACATC GAAAAAAGAT GATGGTCTTA AAGCAGAATT AAAACAAGCA 5100 
ACAAAAGCAT ATGATAAATA TACTGATGAA CAGTTAAATG AATrrTTAAA AGGTACAGAA 5160 
AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 
GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAGAAG CATTTCGAGA TTTAGATCCT 
AAAATTGATG CACGTCTTGC AGATATGAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 
TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATCATGT GACTAAAAAA 
GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 
ATCACACCAA AATTAATGTT ACAAGGTTCT GTTGACCTAT TAAATCAAGT TGCAACTTCT 



5220 
5280 
5340 
5400 
5460 
5520 
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GTCATCAACG ITTTATGGCA CAGCTTGATT TTGGCGGTGT GCCATTTGAA AATGTTATGA 2040 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TATTGTATTA TrGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 2220 

GAGTTCATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 22B0 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 234 0 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 2400 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 2460 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2520 

ATGCAAACX3T ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2580 

GATGTTGTAT TTCXKSTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 264 0 

CAACAAGTGT TTGAAGAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2700 

TCGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGCCAATGA 27 60 

ACGACGATAA AGGCAAGTGA TGACAAGCAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2820 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2 880 

ATTAATAAGC AAATTCTTGA TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 294 0 

ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3 000 

CGACCCTTGC TAGATGATTC TGATGAACGG AAACGCATAA TTAAAATAAG TACGATGGTT 3060 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 3120 

^ CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 3180 

Ca^CGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 3240 
GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3300 
^° GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA 3360 

CCTAGTAATA ATCCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 3420 
ACAATACTGA AAATAACGAC ACCTACATCG ATGGCTAATA GTTTACTACC TTGACCGATA 3480 
ATATCGAATA TATTCAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 3540 
GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3600 
ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 3660 
GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 3720 



10 



1S 



20 



SS 



30 



45 



SO 



SS 



1171 



"t>OCID: <EP 078651 9A2 I > 



EP 0 786 519 A2 

TTGGATTACG ATTAC3TTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAAK3 TATCATTTGT 
TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 
5 CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 
TGTACTATCT TGGACAAGAA GGTGTTCTCT TTAAAGATGA AGACGACTTA GAAATCATAT 
TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 
TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 
CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 
TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TrrTCTAGTT " ATTGAGCAAC 
AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTCCAGTT AATACACCAC 
AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 
GTATAA-rfGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 
AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TCTTCAACAA TTAGGAAGTC 
25 AATTGAACTT GCCTCACTTT TTAGAAGCAG AACGTGAACA AA-TTGAAAGT AAGTTACACG 

AAATATAAAG GAGAATGTrr AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 
TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 
TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 
GATCTTTTTG CAGTTGGTGA AAGTCATCAG GAGCArTTTA CAACACAGGC ACATACGGTT 
GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATAITAAAG TTTCAAGITC TTCAACGATT 
ATTAGTGCAC AGATCCTGTA AGAGTATTTG AAGACTTCGC GACATTAGAT TTGATTrCTC 
ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 
GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 
TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 
AAATATTGCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 
^5 CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTCGTG 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 
GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 
SO AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACAITT GAATACAGGO ATGTCATTTA 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GTCAGATTAT CGAGAAGCGC 



30 



35 



55 



MSCOCID: <eP 07e6519A2_l_> 



1170 



EP 0 786 519 A2 





TTGAAATTTT 


CTTCATTTCT 


TTCGCATTTT 


TCGATTCATC 


TTTCTTGAAC 


TCATTGTTAT 


2940 




ATGGTTCGTC 


ATTTCCAAGA 


ATCATCAACT 


TCATACCATG 


TCGTTTATAG 


TTCGCATTTG 


3000 


£ 


TAATTTGAGC 


TTTACGAGCA 


GGTATTAATG 


GTTTGTCTGA 


TACATCTTTA 


AACATATTTT 


3060 




CTTCACTAGT 


GAAGTTTGGA 


TCTTTCAATT 


CGAAATTGAA 


ACGTTTATAT 


GCATAGAAGA 






TGTATTCTGG 


ATCGGCTGCT 


GGATCAACAA 


ACGCCATATG 


TGTACCATTA 


AATTCTAAAG 


^ 1 o n 

31oD 


10 


CTTTATTAGG 


TGTGCTTGGT 


AATAATTGTT 


TATCAAATGT 


ATCTTGATCT 


AACGTTTTCT 


3240 




TACCTTTAAC 


TTCTTTCACC 


CATTTGTCGT 


AGTCTTTTTG 


ACTAACGGCA 


TTTACTTTAA 


3300 




ATGTTTGACG 


TGAGAATCCT 


TCACCATTGA 


AGTTAGAGTT 


ACGACCTCTG 


AACGTACCAG 


3360 


IS 


TTTGAGATGC 


TTCTAACGTC 


CAATTCATTG 


TCATGCCAGT 


CATGGCATAT 


TTTTGACCAC 


3420 




CTAATTGTGG 


AATCCAGAAA 


CTTGTCATTG 


TATCCATAGC 


TTGAAGCTTA 


AATACAACAG 


3480 


20 


GACGATCTTT 


AGGGATTGTT 


AATGTATTAA 


CAGTCTCTAT 


ATGTTCATCT 


GGATAAGCAA 


3540 


AGAACCATTT 


GTATCCTGCA 


CTTACTGCAT 


ATACAACCAT 


TGGATCTTTC 


TCACTCTTCG 


3600 




GTGGTTTTTC 


GTAATCGTAT 


AAAGTTTTAA 


CTGTAGGAAT 


AGctAAAGCA 


GCAACGATTA 


3660 


25 


TGATAGGTAT 


TACAAACCAT 


ATTGTTTCAA 


TGATGGCATT 


ATGGTGCATC 


TTACCAGATT 


3720 




CX3GCATTCTT 


ATTATAACTA 


TACTTGTAAA 


TAAAAATGGC 


GAACATGCCA 


AGTACAACGA 


3780 




AACAAATAAC 


AAGCATGAAG 


ACGATTGAAT 


AAAGAATCAA 


GAACTTCTGA 


CTACTTGCTA 


3840 


30 


CTGGCCCTTT 


TGCGTTGAAA 


ATTTCTATAT 


TTGAACAACC 


ACTAAGTAAA 


ATTAGTGTGC 


o o r\ A 

3 90 .0 




CAAATAATAG 


AAGCAAAGAC 


TTAAATTTTG 


ACACrriTTT 


GACCTCCTAA 


TACTACAAAT 


3960 




GTAGGGCTTA 


ACATTAATTT 


TAAGTTATTA 


CACAATATTT 


ACAAGGGCTT 


ATGGGAAAAA 


4020 


35 


AATTAATAAA 


ATTGTATCAA 


AAATGTTGAT 


AAATCAAGGT 


GTGACGTGGG 


TTCACACATT 


4080 




TGTTAAAATT 


ATGTGTACAT 


TTTGTGACTA 


ATAGCGTTTT 


T 




4121 



(2) INFORMATION FOR SEQ ID NO: 316: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXSY : linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 
ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 


1140 


5 


TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 


1200 




GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 


1260 




CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 


1320 


10 


AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGITTAATAA GAACCAACCA 


1380 




TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 


1440 




GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTr 


1500 


15 


TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 


1560 




AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 


1620 




TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 


1680 


20 


TTAGGGATAA ATGCTAATGA GAATAGCATT GGTGACTCAA ATGTAATACG TCCTTTATAT 


1740 




AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 


1800 


25 


ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 


1860 


ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACl-TTG ATGTCCGAAT 


1920 




AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 


1980 . 


30 


ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGGCATA 


2040 




CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 


2100 




AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 


2160 


35 


ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 


2220 




ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 


2280 




TAATAGTTAA CACCCXX3ACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 


2340 


40 


CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTCA ATAAAATCAT ACCTCCAAAG 


2400 




AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATCCAA CATCACGTGC TCCAATTTCT 


2460 


45 


AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 


2520 


ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TrTGTTATCA 


2580 




GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 


2640 


SO 


ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 


2700 




TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACIG CGATTAAACC AATAACTAAG 


2760 




AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CrTTAACTAG TAATTGATCC 


2820 



55 
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10 



IS 



20 



30 



35 



40 



SO 



TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACX5TTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60 

^5 AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 360 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 480 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

CATGrroCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 

TTGTATCATG ACfCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 1020 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTCTTGCAIT 
GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 
TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTCTTA AATeTTGCrr TGTAACGATA 
CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 
GCCCATACTG AAGTACCTGC ATTTAATTGT TCTTTTTCAC CATCTACTTC AACTACAAAA 
CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 
AAGTAGCTAA CTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 
GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATA-rTTGCT ACATAATTCA 
GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 
AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCrc CATAGTTAGC AAATTTGTCT 
20 TCGATATGAC GTGATAATTC ACGTGCTGTC ATAACATTTT CAATTTGGAA AGCATGATCT 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 
AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 
TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 
TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATCTTCCTTC ATAGTGATAT 
TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTrT ACGATCTTGA 
GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 
(2) INFORMATION FOR SEQ ID NO: 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



45 



SO 



SS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 
AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 
ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 
GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATCCCACAT 
GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 
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AATTAAATTA TTGTTGTrTT AACCATTCTA CTATCC3TCGG PATCKTM^T CCTGTTGGCC 360 

CTTTTGGACC ATTATATCAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 420 

TGTGAGGTGT TTGACCACTA AAATGTCTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 480 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 540 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

CGTITAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 960 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCKGTTTAT 1080 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 1140 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 12 00 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 1260 

^ aTTGGcCAAT ACTGATACCC TCATCAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 13 80 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 1440 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TCTCCCCaTA TTTTCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT 1560 

AAACCAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATCTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 1740 
^5 AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 
AGATTTGATA TGTTACTCAC TTCAAATPST ACAACAATAA AGCCCTcAGT GACACTGAAG 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 
^° AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 
ATCACTTTTT TCATGAATCC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 
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t CTTCG ACTV; 


v»^-AL. XXjCTCC 


CTCAGGAGTC 


TCGCCATTAA 


TACTACGTAT 


TAACATGTAA 


€900 


5 


TTTTACTITT 




aaaaaataaa 


ACACTTTGCC 


CAACTTGCAC 


ATAAATGTAA 


6960 




AATTCAATAA 


AMI OAA 1 ill 


CTGTGTTGGG 


TCCCTTCGTA 


TAATTTAATA 


AATACCACTA 


7020 




AACTAAATTA 




ttatgtataa 


AATTTATAAC 


ATGACCCAAC 


TTACACTACC 


7080 


10 


AATAGAAAr*P 




TTCCTCAAAA TGATATTTCG 


CGATATGTTA 


ATGAAATTGT 


7140 




TGAAACGATA 


C PTY^ A T A nr^r* 


AATTCGATGA 


ATTCAGACAT 


CATCGTGGCG 


CAACATCCTA 


7200 




TCATCCAAAA 


ATGAT^^TTA & 
A ^J/^ 1 X 1 AA 


AAAT CATCTT 


ATATGCATAT 


ACTCAATCTG 


TATTTTCTGG 


7260 


IS 


TCGTAGAATA 


u>\x/\/\Al lAk» 


TTCATGACAG 


TATTCGAATG 


ATGTGGTTAG 


CTCAAGATCA 


7320 




AACACfT* I T"P 


1 A 1 AAAACTA 


TTAATCGTTT 


TAGAGTGAAT 


CCTAATACTG 


ATGCGTTAAT 


7380 




TGAATCTPTTA 


i> X 1 A 1 1 L-AGT 


TCCATAGTCA ATGTTTAAAG 


CAAAATCTTA 


TTGATAATAA 


7440 


20 


X X \_^rv\ X X X X J. 


Al IVjATGGTA 


caaaagtaga 


AGCTAATGCC 


AATAGATATA 


CATTTGTGTG 


7500 




GAA(!'«AA A Arj T 


Al iUAAAATC 


ACGAATCGAA 


ATTGAACGAA 


AATTCAAAAA 


CATTATATCG 


7560 


25 


X VJJ~k\v X X X X 


l^AAijAAAAAA 


TAATACCAGA 


GATAAAAGAA 


GATGGAGATA 


GCGATTTAAC 


7620 


AATAGAAG& A 


A 1 Avjf ATTT AA 


ttggtagtca 


TTTAGATAAA 


GAAATCGAAG 


ATTTAAATCA 


7680 




X X V. X X X \3e\\3 


& R O/*' TV TV O ■JV rpiTi 

AAULjAAGaTT 


GTGCTCAAAT 


TAGAAAACAG 


ACCCGTAAAA 


AAATAACTGA 


774 0 


30 


G ATT'A A/^ A A n 


rrCAAAAAGA 


AATTTGATGA 


TTATTCCGAA 


AGAAAAAATA AATATGAAGA 


7800 




ACAAAAATCG 


ATTCTTAAAG 


ATAGAAATAG 


TTTTTCTAAA 


ACTGATCTGA 


TCATGATGCA 


7860 




ACTTTTATGA 


GAATGAAGGA 


AGACCATATG 


AAAAATGGCC 


AACTTAAGCC 


AGGATACAAT 


7920 


35 


TTACA7UVTAG 


CGACAAATTC 


TCAAAAATGT 


TTTATCCTAT 


GACCTATTTC 


AA 


7972 




(2) INFORMATION FOR SEQ ID NO: 313: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 
ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 
CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGrTCA TTTATGCTCA 
CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 



lNSCK»aO: <ep 07a6519A2_l_> 



1164 



10 



IS 



EP0 786 519 A2 

TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 546 0 

AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACX5TG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC 7VATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6000 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

AGA;^TGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

TCCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 64 80 

AATAGGTTAT TTGATAATGA TAGTGTATCA ATGGCAATCT ATATAAATGT TTATATCTTT 6540 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 
AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 
nTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 
ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 
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GGTGCACATA AAGATATTGA ACAGCAAGCT TTAGCTATTA AATCAGCGTT TGTTGCACAG " 330, 

TTAGATTCAC TTGAGTCATT ATCACGTGAT GAAATTGCTA ATCATCGCTT TGAAAAATTC 336, 

AGAAATATCG GTTCTTATAT AGAATAATCA ACTTGAGCAT TrrTATGTTA AATCGATACT 342, 

GGGTTTTACC ATAAATTGAA GTACATTAAA ACAATAATTT AATATTTAGA TACTGAATTT 348( 

TAACTAAGAT TAGTAGTCAA AATTGTGGCT ACTAATCTTT TTTTAATTAA GTTAAAATAA 354( 

AATTCAATAT TTAAAACGTT TACATCAATT CAATACATTA GTTITCATGG AATGACATAT 3 60C 

CAATTTGTGG TAATTTAGAG TTAAAGATAA ATCAGTTATA GAAAGGTATG TCGTCATCAA 3660 

,^ GAAAATTGCA GTTTTAACTA GTGGTGGAGA TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3720 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT TGAAGTTTAT GGTCTCTATC ATGGTTACCA 3780 

AGGATTGTTA AATGATGATA TTCATAAACT TGAATTAGGA TCAGTTCGGG ATACGATTCA 3840 

GCGTGGAGGT ACATTCTTGT ATTCAGCAAG ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3900 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG AGGGATTGAG GGCCTTGTAG TTATTCGTGG 3 960 

TGACGGTAGT TATCGCGGTG CACAACGCAT CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4020 

CGGTATTCCT GCTACGATTG ACAATGATAT CAATCGTACT GATTTTACAA TTGGATrTCA 4 080 

CACAGCATTA AATACGATTA TTGGCTTAGT CGACAAAATT AGAGATACTG CGTCAAGTCA 4140 

CGCACGAACA TTTATCATTG AAGCAATGGG CCGTGATTGT GGAGATCTAG CATTATCGGC 4200 

TGGATTATCA GTTGGTGCTG AGACAATTGT AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

AATAGCTGAT AAAATTGAAC AAGGTATTAA ACGTGGTAAG AAACACTCAA TCGTrCTTCT 4320 

AGCAGAAGGT TGTATGACTG CGCAAGATTG TCAAAAAGAA TTATCACAAT ACATCAATGT 4380 

TGATAATAGA GTGTCTGTGT TAGGTCACGT TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TAG;^iTTTTA GCATCACX3TT TAGGTGGATA TGCGGTAGAC TTATTAATGC aAGGTCAAAC 4S00 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA AATTGTAGCA ACATCiriTG ATCAAATTTT 4560 

TGATGGTAAA GATCATAAAT TTGATTATAG TCTATATGAA CTrcCTAACA AGTTATCTAT 4620 

ATAAGATTTC AGGAGGAATT ATAAAATGAG AAAAACTAAA AITCTATGTA CAATTCGACC 4680 

AGCTTCAGAA TCAGAAGAAA TGATTGAGAA ATTAATCAAT GCTCGTATGA ACGTTCCACG 4740 

ATTAAACTTT TCACATGGTA GTCATGAAGA GCATAAAGGT AGAATTCATA CAATTCGTAA 4800 

AGTAGCTAAA AGATTAGACA AAATTGtAGC AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TCGTACGCAT AATATGAAAG ACGGTATCAT TGAACTTCAA CGTGGCAACG AAGTTATTCT 4920 

TAGCATGAAT GAAGTTGAAG GAACACCTGA AAAGTTCTCA GTAACATATG aAAACTTAAT 4980 



55 



NSOOOD: <EP 07865 19A2_I_> 



1162 



10 



15 



20 



25 



EP 0 786 519 A2 

ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 1680 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 1740 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

aAGTTATTTA GAAAAAATTC AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATCGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 204 0 

CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

TTTTCCATCA GTTGGTCATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2280 

TCXSTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 2340 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 2400 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2460 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2520 

GTCGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAAOGAGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTCCA AGAAAGACCT 2640 

ACGACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACXSTTTAA TGAAACAAGC TGAAAAATTC 2880 

AATCGTCCTA TCTTTACATT TATAOATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3 060 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3X2: 
TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 
75 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 
CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 



60 
120 
180 
240 

CTAATAAAGA GGAATTAAGT TTAGCATAPT* n'h.nn'vnnnrri.T 

-w^ww* A^rw»^*»o ivji/\rt/\vjMi/\ JOO 

TTTATGAAGA TMAAGAPAA GTATATGATT ACACAATTAA AGGAAATACA GTTCCAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 54 0 

ATCGTGGTAT TAATTTAGAG GATATTTCGG (:».CCACGTtG TTTTGAAATT GAAGAACGAT 6 00 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

35 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 84 0 

GTATCAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

'>° CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTCTA GCTAATCCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TCCACGAGTT GTTGGTACAG 1080 

4S 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 114 0 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 13 80 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 
iO STRANDEDNESS : double 
(D) TOPOLOGY : 1 inear 
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(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 311: 
CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 
CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT 
TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 
TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 
GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 
CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC ACAAAAGTTA 
AATCATTAGG GTTTGTAAAA TACATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 
GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 
TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 
TTTCATTTGT TGGTATGTGA TGATCACTAC CTGCACCTAC ATGCGATAAG ATTTCATTTT 
TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 
GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 
GTACJVTCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 
TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGTAGT GCAATTAATG 
ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 
TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 
TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 
AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 
ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT ACACGTGTGT 
TAGCTAAAGA TACATCATCA TCTAAACGGA AGTACACACT TCCTGACAAA CTATGTTGAT 
GATCGATGAC GATACCAGTT AAATTAGACA TTAAGGCAAA CAAACTGAAA ATAATACATA 



60 
120 
180 
240 
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420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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AGGCPiACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 1032 0 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 10380 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 10440 

ATGCAATTAA AAACCCGGCT ATCATCGATA AaGAACATAC AGCTGATAAT TGGCGACCAA 10500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT CTATCATTAT GCTAGTACTG 10560 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGGACCAAT AATTGAATTA GGTTTAAAGA 10620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 1068 0 

TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 10800 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 10860 
20 AAGAAACATA CAATTTACAA AAATTATTAG CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 
GACAAGTTTA TGAATTAGAA AAATTACAAG AGAAATTGCC AGAAAAATAT AAGGCGGAAT 
ATAAAAAGAA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 
CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTTACAA GAAGCGCATT 
TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 
CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 
GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 
AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTGA ATATCATGTC AGAATTATAA 11400 

ATCAGGATAT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 11460 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

40 CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGAOG 11580 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 1164 0 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 11700 

TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 11880 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG CCGAGTGATA 11940 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAG7VAAA AGCCGGCACA CCATCGAAAG 12000 
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GTGTAGCGAA 


AGATAATTTA 


AAAGAAGAAA 


ATGTAGAAGC 


AGTAAAAGCA 


GGAATTGTTA 


8520 




ATTTAGAGCG 


TCATGTTAAT 


AATATTAAAA 


AATTCGGTGT 


AGAACCGGTT 


GTTGCAATTA 


8580 


5 


ATGCATTTAT 


ACATGATACC 


GATGCAGAAG 


TAGAATATGT 


AAAATCTTGG 


GCTAAAGAAA 


8640 




ATAACGTACG 


AATTGCCTTA 


ACTGAAGTTT 


GGGAAAAAGG 


TGGTAAAGGT 


GGCGTTGACT 


8700 


10 


TAGCAAATGA 


AGTATTAGAA 


GTCATTGATC 


AACCTAATTC 


ATTTAAACCT 


TTATATGAAT 


8760 


TAGAATTACC 


ATTAGAGCAA 


AAGATTGAAA 


AGATTGTGAC 


TGAAATCTAT 


GGCGGTTCAA 


8820 




AAGTAACGTT 


TAGCAGTAAA 


GCGCAAAAAC 


AATTAAAACA 


ATTTAAAGAA 


AATGGTTGGG 


8880 


IS 


ATAATTACCC 


AGTATGTATG 


GCGAAAACAC 


AATATTCATT 


CTCAGATGAT 


CAAACGTTGT 


8940 




TAGGTGCACC 


ATCAGGATTT 


GAAATTACAA 


TTCGTGAATT 


AGAAGCGAAA 


ACAGGTGCAG 


90X)0 




GATTTATCGT 


AGCGTTGACA 


GGTGCAATCA 


TGACTATGCC 


TGGTTTACCT 


AAAAAACCAG 


9060 


20 


CAGCATTAAA 


CATGGATGTT 


ACTGATGATG 


GTCATGCAAT 


TGGGTTATTC 


TAATAAATCA 


9120 




TGTCAATTGT 


TTAATAAAGA 


TAAGTAAATA 


GTTTAATAGA 


CCGGACTGTT 


GGAGATGCAT 


9180 




TATTTCAGCA 


GTTCGGTTTT 


TTGCTGTGCT 


AAAAATAGAT 


TCAATTTGGC 


GAATCTAACG 


9240 


2S 


ACAATGTTTG 


AAGGTGGTTA 


ATTAATGTAT 


ATGAAGATAA 


AAAGTGGGCT 


TGAAGAATAG 


9300 




GAAAGCGATG 


CAATGAATAT 


TCCATATTAA 


AAAAAATTAA 


TAAAATAGGT 


TGCAATATTT 


9360 




AATTGGGATG 


CGCTACAATT 


AACACTAATA 


ATTGATATTG 


ATAATTATTA 


TCAATTAAAT 


9420 


30 


ATAATCTTAT 


AGGAGTTGTT 


AACAACATGA 


ACAAACATCA 


CCCAAAATTA 


AGGTCTTTCT 


9480 




ATTCTATTAG 


AAAATCAACT 


CTAGGCGTTG 


CATOGGTCAT 


TGTCAGTACA 


CTATTTTTAA 


954 0 


35 


TTACTTCTCA 


ACATCAAGCA 


CAAGCAGCAG 


AAAATACAAA 


TACTTCAGAT 


AAAATCTCGG 


9600 


AAAATCAAAA 


TAATAATGCA ACTACAACTC 


AGCCACCTAA 


GGATACAAAT 


CAAACACAAC 


9660 




CTGctACGCA 


ACCAGCAAAC 


ACTGCGAAAA 


ACTATCCTGC 


AGCGGATGAA 


TCACTTAAAG 


9720 


40 


ATGCAATTAA 


AGATCCTGCA 


TTAGAAAATA 


AAGAACATGA 


TATAGGTCCA 


AGAGAACAAG 


9780 




TCAATTTCCA 


GTTATTAGAT 


AAAAACAATG 


ATUVCGCAGTA 


CTATCACTTT 


TTCAGCATCA 


9840 




AAGATCCAGC 


AGATGTGTAT 


TACACTAAAA 


AGAAAGCAGA 


AGTTGAATTA 


GACATCAATA 


9900 


45 


CTGCTTCAAC 


ATGGAAGAAG 


TTTGAAGTCT 


ATGAAAACAA 


TCAAAAATTG 


CCAGTGAGAC 


9960 




TTGTATCATA 


TAGTCCTGTA 


CCAGAAGACC 


ATGCCTATAT 


TCGATTCCCA 


GTTTCAGATG 


10020 




GCACACAAGA 


ATTGAAAATT 


GTTTCTTCGA 


CTCAAATTGA 


TGATGGAGAA 


GAAACAAATT 


10080 


50 


ATGATTATAC 


TAAATTAGTA 


TTTGCTAAAC 


CTATTTATAA 


CGATCCTTCA 


CTTGTAAAAT 


10140 




CAGATACAAA 


TGATGCAGTA 


GTAACGAATC 


ATCAATCAAG 


TTCAGTCGCA 


AGTAATCAAA 


10200 
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SS 



TGATGTAATT 
TGAACACGAA 
AATAATTAAG 
AGAAATTCGT 
ATTTAAAGAT 
TTGGGAATTA 
TTGAAGATTT 
TCGTTTACTT 
AAAGCATTTG 
TAATATGATC 
TCAAGCGCTA 
AGCGCTAACC 
AAAATTTCTT 
AAAAGTTGAC 
AGGATATTGC 
AAGCTAAAAT 
TAACTGCGAT 
CTGATGCATT 
GACCAACATT 

tggaagatat 
catxStctgc 
gacgtattga 
ttgggttagg 
cgtctgaaat 

TTAGTCGTAT 
AAGTGGAAGG 
CAATTGAAGG 
GTAACTCAAT 
CTGGATTTGG 



ATGACAGCTG 
GCAGTTGCCG 
GCGTTTGTTG 
ATATTTGTTA 
AAATTACCTA 
AATTTAGATG 
TTTTCGAAGA 
TACGATAAGC 
AAAATGATGA 
TTTGGAAGAT 
AAAGTTGTAT 
CGATTTAGTC 
AGCTACAACT 
TCATTTATCA 
TGCATCAGTA 
CGACATTAAT 
GAGCCCAACA 
CCATGAGTTA 
TGGTATCAAA 
CAACTTACAT 
GTTTATCGAT 
GTGGAAACX5T 
TGGACCTACA 
TATGGCGATT 
TACTATTGGT 
TGCACTTGCA 
GACACCTGCA 
TTTAGCAACT 
TTCAGACTTA 



GTGAACGAGT 
AAGCAGGAAT 
CACTGAGAAA 
AAGAAGGTTT 
AAACACGGTC 
CTGGGGATTT 
ATAAAGGGTG 
AATATAAAGA 
CATAACAATA 
GATTATTTTA 
AATACAAAAC 
GACAAGTTTT 
TACATATTAT 
GATTTAGATA 
GGTATTTCAG 
AAAATTACX5C 
CCAGCTGGTG 
AATAAAAACG 
GGTGGTGCGA 
TTCAACGGAG 
AATCATATTC 
GTATTAGATA 
AATGGTGTAC 
TTATGTTTAA 
TACACTAGAG 
ATGATTTTAA 
TTAGTTCATG 
GAAACAGCAC 
GGCGCTGAAA 



TGGACCATTT 
TATTGGTAAA 
AGGATATGAA 
GTCGGCACAT 
AGGTAAAATT 
AAGTACAATG 
ACAACATATT 
ACTGTTAACT 
ATGGCATATC 
AATAATAGAA 
TTTAATAAGT 
TAACAGTTCG 
AAATGCATAA 
TTGCGAATC7\. 
AGGATGCATT 
CAAGAGAAAA 
AAGGTAAATC 
TTATGGTTGC 
CTGGTGGTGG 
ATTTCCATGC 
ACCAAGGTAA 
TGAATGATCG 
CACGTGAAGA 
GTAGAAGTAT 
ATCGCAAGCC 
AAGATGCAAT 
GTGGACCATT 
GTGATTTAGC 
AATTCATGGA 



GAGGTTGAGT 
CCTGATCCGG 
CCAACAGACG 
GCAGCACCAC 
ATGAGACGTG 
GAATAATGAC 
TCATGTCAAT 
TGTGTCATAT 
TTTATATTGC 
AAATATAGTT 
GAATTTATTG 
TTATTATATG 
ATTAAACAAA 
ATCAACACTA 
AGAACCTTAT 
CAAAGGGAAA 
AACGGTTACA 
ATTAAGAGAG 
TTATGCGCAA 
GATTACAACT 
CGAATTAGGA 
TGCACTTAGA 
TGGCTTTAAT 
TAAAGACTTA 
AGTTACAGTT 
AAAACCAAAC 
TGCXSAATATC 
TGATATCGTT 
CATTAAAGCG 



CTAAATTGGT 
TTCGCGGTGA 
AATTAAAAGA 
GTGAAATCGA 
TATTAAAAGC 
ATGAATGTTA 
GTTTAAATAA 
CATTTCGTAG 
TTTTTATTTT 
ATCAATAGTA 
CAAAAATGAA 
AATGTAAGTA 
AAGGGGCGAA 
CAACCAATTA 
GGTCATTACA 
GTTGTTTTAG 
GTTGGTTTAG 
CCTGCTTTAG 
GTCTTACCTA 
GCAAATAATG 
ATCGATCAAA 
CATGTAAACG 
ATTACAGTAG 
AAAGATAAAA 
GCAGATTTAA 
TTAGTACAAT 
GCACACGGTT 
GTAACGGAAG 
CGTGAAGCAG 



6720 

6780 

6340 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

73 2 0 

7380 

7440 

7500 

7560 . 

7620 

7680 ' 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 
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TACCTGTAGA 
GTTGATCTTG 
CTTTATACTG 
TCAAATCTtC 
CTTCGGATTG 
TGTATTAAAT 
AAAAATTTTG 
AACAAAGGGG 
CCTTAAAGAT 
TTCTTGGAGT 
TCAAGGATTA 
TACTTATAAA 
AGAAGTTGAC 
TGCGTTGTTA 
GGAAAAGGCA 
GGCATTGTTA 
AGATGAGGAT 
TGAATTTGAC 
TTCTACTGGG 
TTCTGGAAAA 
AGGTTGGGTT 
TTGTATAGCT 
AGTGACGATT 
TATTGTTGAG 
AAATCCTGAA 
GTGGATGACA 
TGGCTCAATG 
TGAATTACCA 
GTATCGTATC 



CCAGCGCTCA 
GCGAGCAACA 
ATCTTCAGCA 
GTGTGTTAAA 
ATACGTCTTT 
TCATTATATA 
AATAATTAGA 
GAGTTTCAAA 
TATGAAGAAA 
GAAACTGGAA 
GGGGATAAAA 
GATATGCAAC 
AAAGGTGACA 
GGTGTTTTAA 
GTTGCGGATA 
CCTCGAGTAC 
GTAGAAGACA 
ATTGAATGGT 
CAACCTAAAG 
TATGTATTAG 
aCAGGAACAT 
GGTGGTCGCT 
TGGTATACGG 
AAATATGACT 
GTTATAAAAT 
GAAACAGGTG 
GGCAAACCAT 
CCAAATCGAA 
TGGAAGAATC 



ATTTCATCAG 
TAAATTCTAC 
TCTCTAAATG 
GGACCTTCAA 
AAATGATTCA 
GAAslATTTAC 
AATGTTATGT 
TGAAAGTCGA 
CATATAATAC 
AAATGAACAT 
TAGCGTTAAA 
GGTTATCTAA 
GAGTATTTAT 
AAATTGGTGC 
GATTAGAGAA 
CTGTAGATAA 
ATTACATAGA 
TAAAGTCGGA 
GtGTATTGCA 
ATTTACAAGA 
CTTATGGTAT 
TTTCGCCAGA 
CACCAACAGC 
TGTCATCGTT 
GGGCGAAAAA 
GACATATGAT 
TACCTGGTAT 
TGGGCAACCT 
CAGAAAAATA 



GATAGTGGAA 
CTTCTGGTAA 
CGGTTAAATG 
TAACAAATTG 
TATTTTCAAC 
AATAATTAAT 
ATAATATTGA 
AGTTTATAAA 
TTTTGATTGG 
GGCATATGAA 
TTACAAAGAT 
TAAAGCAGCG 
ATTTATGTCG 
AATTGTTGGG 
CAGTGAAGCT 
ATTACCAAAC 
CTTCATTAGT 
TGATGGTTTG 
TGTTCAACAA 
AGATGATGTT 
TTTTGCACCA 
ACAGTGGTAT 
TTTAAGAATG 
ACGTTCGATT 
AGTATACGGT 
TGTTAACTAT 
TCAAGCTGCA 
TGCTATAAAA 
TAAATCATAT 



AGTGACATAA 
TGTTGTAATT 
CGCATCGAAA 
CTTGTCATGA 
TCCTCAATCG 
ACTAGAAAAA 
GAAAGAAAGC 
GGAGCGCAAG 
AAAGACGTAG 
TGCATAGATC 
GAGCACAGAA 
AATGTTTTGT 
CGTACACCTG 
CCGTTATTTG 
AAAGTGTTAA 
TTGAAAAAAA 
TTGATGGAAA 
ATTTTACATT 
GCAATGTTAG 
TATTGGTGTA 
TGGTTAAATG 
AGTATGATTG 
TTAATGAGTG 
CTATCAGTAG 
TTAACXXSTGT 
CCAACGATGG 
ATTATCGATG 
AAAGGCTGGC 
TTTATTGGAG 



CCCACAATGA 
TCTAACAAAG 
GTAAGCGCTT 
ATGTAATT^AT 
ACGTCGTGAT 
GGATAAAAGT 
GTTTTCACAT 
GTAAACATAA 
AACAAGCATT 
GCCATGTAGA 
AAGAATCGTA 
CTGAACATGC 
AACTATATTT 
AAGCATTTAT 
TTACTAATAA 
TTGTTGTCGT 
CTGCTAGCGA 
ATACATCAGG 
TGCACTATAT 
CAGCAGATCC 
GCGCTACAAA 
AAGATTTTAA 
CTGGTGACGA 
GTGAGCCTTT 
TAGATACTTG 
ACGTCAAGCT 
ATGCAGGGAA 
CATCAATGAT 
ACTGGTATGT 



4920 

4960 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 
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10 



2$ 



30 



3S 



45 



50 



3120 
3180 
3240 
3300 



3780 
3840 
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CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 
TTPJ^CAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 
ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 
AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3 360 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 34 2 0 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 3490 

^5 AACCACTTTG AATTGGTTGA TCAATTAAAC TTTVAGAATAC ATGACTCCAT GCACGTGGTA 3540 

CGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 372 0 
GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 
CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3 960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAACCACCAC 4 080^ 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4 200 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4 620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TrTTTATAGT 4 680 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

CGATATAATT CTCX5AATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 800 
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JKAPiAGCCTPiT 


TCTATTAAAA 


CGTGGTTTAT 


CTGCTACAAT 


CGAAGAGTTT 


GTTTATGCAG 


1320 




CTGAATACAT 


TGCTTCACAA 


GGTAATCAAA 


ACATTATTTT 


ATGTGAACGT 


GGAATCCGAA 


1380 


5 


CTTATGAAAA 


GGCGACACGT 


AACACTTTAG 


ATATTTCAGC 


AGTACCAATT 


TTAAAACAAG 


1440 




GTACACACTT 


ACCAGTCATG 


GTAGATGTTA 


CGCATAGTAC 


AGGTCGTAAA 


GATATCATGT 


1500 


10 


TACCAACTGC GAArsAgCAT 


TAGCAGTTGG 


TGCTGATGGA 


GTTATGGCTG 


AGGTGCATCC 


1560 


AGATCCATCT 


GTTGCACTTA 


GTGATGCGGG 


TGAAGAAATG 


GATTTAGATG 


AATTCCAAGC 


1620 




ATTTTATGAT 


GAATTAAAGC 


CTTTAGCTGA 


TTTATATAAC 


GCTAAAAAGT 


TAAAATAATA 


1680 


15 


TTCCAAGGAA 


ACTATAGACT 


ACTTAACTAA 


TATGTCATGT 


TGAAGTAGAA 


TATTATCTTT 


1740 




GAATCGACAA 


TTTTAAACTT 


ACAGCCATTC 


TAAGAGTATA 


TTACriTTAG 


AGTGGCTATT 


1800 




ATTTTTTGTA 


TAGAAATAAA 


GGTATACTGC 


ACTTAACGAT 


TGTTATAATA 


CTTCGACACT 


1860 


20 


TGTTCAATTT 


CACAATTATT 


AAAGATTATG 


ACTGATAGCA 


GTAATTAAAA 


TTATAACTAT 


1920 




GAATTATCTG 


TAAAATATAA 


TAGATTCACA 


CATTTGTTGC 


TGAAATGTGA 


ACATTTTTCA 


1980 




ACAAATGCAA 


TTGATATTTG 


AAAAGGCTTT 


CTCAAAACAT 


TACAATTAAA 


AATGAAAAAA 


2040 


25 


GTTTATATAA 


AATTAAAATA 


TATCGTTCGT 


TATCATTTAG 


CGTTTGTTTT 


TATTTCAAGC 


2100 




TTTTCGCTAA 


ATTTTTCCAA 


ACAAAAATAT 


GTTACTGTAA 


ATTAAAATAT 


GGTAAACTAT 


2160 




GAAAATGAAA 


TGAAAACATG 


TTATTATAAT 


GAATAAAACG 


TTTACAAGGA 


GGAAATTATG 


2220 


30 


ACAGTTACTA 


TATATGATGT 


AGCAAGAGAA 


GCGCGTGTCT 


CTATGGCCAC 


AGTGTCGCGT 


22180 




GTTGTTAATG 


GGAACCAAAA 


TGTTAAAGCA 


GAAACTAAAA 


ATAAAGTTAA 


CGAAGTCATT 


2340 


35 


AAGCGTTTGA 


ATTATCGTCC 


AAATGCTGTT 


GCTAGAGGTT 


TAGCTAGTAA 


AAAGACAACA 


2400 


ACAGTAGGTG 


TGATCATTCC 


AGATATATCT 


AATATCTATT 


ATTCACAACT 


TGCTCGTGGA 


2460 




CTTGAAGATA 


TTGCAACAAT 


GTATAAATAT 


CACTCAATTA 


TTTCAAATTC 


AGATAACGAT 


2520 


40 


CCTGAAAAGG 


AAAAAGAAAT 


TTTTAATAAC 


TTATTAAGTA 


AACAGGTTGA 


TGGTATTATT 


2580 




TTCCTTGGTG 


GTACAATTAC 


TGAAGAAATG 


AAAGAATTGA 


TAAATCAATC 


ATCTGTACCT 


2640 




GTAGTAGTAT 


CAGGAACAAA 


TGGTAAGGAT 


GCACATATAG 


CATCAGTTAA 


TATTGATTTT 


2700 


45 


ACTGAAGCTG 


CGAAAGAAAT 


TACGGGAGAA 


TTAATTGAAA 


AAGGCGCTAA 


ATCATTTGCT 


2760 




TTAGTAGGTG 


GAGAACATTC 


TAAAAAAGCT 


CAAGAAGATG 


TTTTAGAAGG 


TTTAACTGAA 


2820 




GTGTTAAATA 


AAAATGGCCT 


TCAATTAGGT 


GATACATTGA 


ATTGTTCTGG 


TGCTGAAAGT 


2880 


50 


TATAAAGAAG 


GCGTAAAAGC 


TTTTGCCAAA 


ATGAAAGGCA 


ATTTGCCAGA 


TGCCATTTTA 


2940 




TGTATCAGCG 


ACXSAAGAAGC 


AATTGGTATT 


ATGCATAGTG 


CAATGGATGC 


TGGTATTAAA 


3000 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 
CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 
ACCAGATA 

(2) INFORMATION FOR SEQ ID NO : 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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600 

660 
668 



60 
120 
180 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTTCATGTA TTTATGTTGT TTGAAACATC 
GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 
CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTA/VAT ATAGCTGAAT 

GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

TTTTTATTAA GCGTTAGGTT GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 3 00 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 360 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 420 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 54 0 

35 GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 600' 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 

AACQl'GGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 780 

TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 840 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 1020 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 1140 

TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGACGTAT 1200 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TTATTATTAC 1740 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACX3CTAAA CAAATTCGAC 1800 

CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 1860 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 1980 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 204 0 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTGCACAT ACAATATTAC 2160 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 2340 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATTAGCATCA CAGAATTTAA 24 00 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 2460 

25 TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 
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15 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



C (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 09: 

CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATCC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 

45 TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 240 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 480 



55 



iSDOaO: <EP 0786519AZ_I_> 



1151 



EP 0 786 519 A2 



2S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 308- 

S 

GATATCGTAS CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 
TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 
TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 
TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 
GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAAITXSCAAT TGCAGGATTT 
15 ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GPAATTTATA TGTAAGTATT 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 
GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 
AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 
ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 
GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 
CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 
TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 
ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TrTTAACTAG 
CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 
GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 
35 GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 1080 
TCAAtCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 
CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 
CACTAGCCCA TAmATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 
TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 
TCGTTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 13 80 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 1440 
TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTCC GATTTTAGTA GGCGGTATAC 1500 
TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 
CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 



40 



45 



SO 



1020 



1620 
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TGCCCAATCA 


ATtTCTTTCT 


TAtCCTCAGC 


AgTTAATCTA 


CCAAATCCTT 


TTTGATGTGG 


4680 




AAAACGACCA 


TATGAAACTA 


ATTCCCCAAC 


AGTTAAGCCA 


TCTGCTACTT 


CaGGTGaTTG 


4740 


5 


aGGTAAAaTG 


GCtATTTTTt 


TGcAATCyCy 


TTCGTAGATt 


GTGtATGAAT 


ATTTTCACCA 


4800 




TCTAAAAATA 


CTTCGCCTTC 


TTTAACTGCC 


AATAAACGTG 


ACAATGCCTT 


TAGCAAAGTA 


4860 


10 


GATTTCCCGC 


AGCCGTTAGG 


ACCAATGATT 


GACGTCACTT 


TGCCATCTGG 


TATTTGAACA 


4920 


TCTAATTTAT 


TTATAATCGT 


GTTATCCCCG 


TAACCAATTT 


TAACTTGTTG 


TCCATGCAAA 


4980 




CGATTCATAA 


TTTCCCTACT 


TTCAATAAAA 


TTCTTTCTGT 


TTATAAAAAA 


TAATTTCTAT 


5040 


IS 


TTTTAAATTA 


TCAATTTTCA 


AAGACATCCC 


AATTGATAAT 


GATTATCATG 


AACATCATTA 


5100 




TAACATi-rrr 


CAATCTTATT 


GACTAACATT 


AcrrriTAAA 


TTGGATAGCT 


CGATTTGTCA 


5160 




TGTCTTGTAT 


ATTACTTTTA 


TAAAATAAAA 


AACGCCCACA 


GATAAGTCTT 


CATAGTTCAA 


5220 


20 


AAACTTGTCC 


GTGGACTTCT 


ATTTAAGTAT 


GTGTGCTCAT 


ACCATTTATT 


TATTCATCTG 


5280 




CAAGAAAGCC 


ATTACCATAG 


ACATCTCTTA 


CATCATGAAT 


TACGAGGAAT 


GCATCTTTAT 


5340 




CGATTTGTTT 


AATTAATCGC 


TTTGCTTTTG 


AAACTTGTGT 


TTTAGAAATA 


ACAACGTATA 


5400 


25 


AGACATCTTT 


TTCTTCACGC 


GTATAATAGC 


CATGTCCGTT 


TAAAATGGTT 


AAACCTCTTC 


5460 




CAATTTGCTC 


GTCTATTGCT 


TTGGCAAGTT 


TGTCGGGATT 


AGTTGAAATA 


ATCGTCATAG 


5520 




CrriTTTAGT 


GTTTAAACCT 


TCTATGACAT 


ATTCCATCAC 


TTTTGTTCCT 


ATATAAAGTG 


5580 


30 


ATATTACTGT 


TACTAATACT 


TTATCAAGTG 


GAATAACTGT 


AAGTGAAATT 


GCAACAACGA 


5640 




TCATATCGAA 


GAAAAGCAAA 


GCATATGGCG 


TGCTTACATC 


GAGGTATTTC 


GTTGCAATTC 


5700 


35 


TCGCCAAAAT 


TGTTGTACCT 


GCTGTTGTAC 


CGCCTGCAAG 


GATAATTACT 


CCGATTCCTA 


5760 


GTCCAACGCT 


TACACCACCA AAAATGGCAT TCACAATGCT 


GTTTCCAGTT 


TCTACTTGCC 


5820 




ATGATTCTGT 


TAAACTCAAA 


AATATTGAAA 


TAAGAATTGT 


TACAAGAATA 


GTTAAGTACA 


5880 


40 


TACTTCTCTT 


ACTCAAAAAt 


TTATAACCTA TGGCAATCAA TACTGCGTTG 


ACCAAGAAGT 


5940 




TAGTGATGGC 


TGGTGAAATA 


TGAAACGCAT 


AATATAAAAT 


AATTGCTAAA 


CCTGTAACCC 


6000 




CGCCTTCACC 


TAAGTTACCA GAAAT^ATAA ATGCATTTAC 


ACCTGCAGCA 


AAGATAAATG 


6060 


45 


AACCTAAGAC 


AACTAGTATT 


AAATCTTTAA 


CCGTTTTATT 


CACGAAACCA 


TCCCCTTTAT 


6120 




ATATTTATTA 


GACTAT 










6136 



(2) INFORMATION FOR SEQ ID NO: 308: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGC GTTAATACCT AATATACCTG 28 80 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCACTAT-TGC ACCACTAATA CTAAgCtGCG 294 0 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 3000 

ATGGCATCAC CTTGTCCAAT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 3 060 

CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCAATTGTC TGCGTTTATT ATTTGAACTA ATCATATTGA GCGTCCTCCT TTTTTAACTA 3180 

AATATAAAAA GTAAGGAACA CCGATAAATG AAATGATTGC ACCAACAGGC GCTTCTCCTA 324 0 

,5 AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 33 00 

GTAGAATTTT AGCATAATCA GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 34 20 
TTAAACATAT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGCT AATGATTCAC 
CAAGATTTAA AATGGTAAGT TGTTTACTAA TTGTTAATAT AATGAATAAC GCAATACCAA 
TTAATGGAAT TGCCCACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 

AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3 660 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

TAGATCGGCC TATCATTAAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3780 

ACATCAATAT TAAAAATGAA GTGTTTGGTA AAACTGCATA TGTTAATGCT AAAGCAAATG 3 840 

AAGCACCTGA ATTTAAACCT ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACACCTT 3900 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTTCTGG 3 960 

GAATCCTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA TTAAAAATAG 4 020 

CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAAATAGTG 4080 

40 TGATTAGTAG TATCATACTT APACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT TTATCATATT AACTTACTTA ATTAAGAATA 4200 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4 260 

TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

TGAACCTGTA GCGATGATAT CCCCTTCTTT CATCGCAATA AGATGATCTG AGAAACGAAT 43 80 

CGCTTGGTTG ATATCATGAA GAACCATGAC AATTGTACAA CCTTGTTCCT GATTTAGCTT 444 0 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATATCT AAATATGTTG TTGGTTCGTC 4500 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4560 



55 



INSOOaO: <EP 07865 19A2_L> 



1148 



EP0 786 519 A2 



10 



IS 



20 



25 



30 



3S 



40 



45 



SO 



ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 
TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 
ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 
GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 
TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 
TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 
CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 
GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 
AGCAATGTCG TCAOGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 
TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 
ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 
TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 
TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 
TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG 
CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 
TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 
CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 
TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 
AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 
TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 
CCAQCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 
ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 
TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 
AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 
CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 
GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 
ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 
ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA GACCTATTAA TACCATACTC 
GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 



1080 

1140 

1200 
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1380 

1440 

1500 
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1980 
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2100 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAPiPiPiAATC CCCAAACTAA 
ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 

5 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 
CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5940 
GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 
ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 
ATCTTTTAAT AGTTTT 
75 (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
ACCGCCGATT GATAGCTTTA CTGCTGCGAT AAAACTGCCT TCAACAATTG GTGCATCAAC 
TTTTAACACA CGATGATTAC CATCATACAT TTCAATTGCC ATATCTACAT TCATTTCTGA 

AGATCCAATA TCGTAAAAAC ACAATGCATC ATCCTCTAAT TTAGTCAAAA CTTCTTGGAT 180 

GATATCAAAT GAAGTTCCAA TTGAACCATC TGGTAATCCC CCGATTGGTA TAATATCAAC 24 0 

GTCACCTGCC ATTTGCTTTA ACAAAGATTT TGTACCACTT GcAATTTCTT TACTGTGGCT 300 

3^ AACAAGTATA ATTTTAGGGA TTATCTTCAT CTCCAATCAA AGCGTTTAAA ATATAAACCA 360 

TACTTTGAGC ACCTGGATCA ATATAACCTT TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTTTCGTTGC TACCATATCT TTTGTATTAT CTGCTAATTG CTGTAAATCA TTGAATGTTA 4 80 

AAGTTTCACC ATTTTTAAGC TTCTCTGCTG CTOGCGCTAC TACATCATAC ATTGTCTTTT 540 

CATTTAAAGT AACTTTACCA CGTGATTCAA CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 6 00 

TGAAATCTTG ATTATCCATA TCATCTTTGG TGACTGCAQA CATTTTAACA AAGCTAAAGC 660 

CATACAGTGG TCCTGATGCA CCCCCAACAT TTGACATCAA TGCCATACCA GTTGATTTGA 720 

ACAATGATTG CATTGAGCTA TCATCAAGTT TGTCTTTAAG ACTACTAAAC CCACGAACCA 780 

TGTTAACCCC GTGGTCACCA TCACCAATTG CTCGATCTAA TTCAGTTAAT TCAGATTCAT 840 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA AACGTGCTTT CATATCATTC ACTTTCATTT 900 
GTGCCACCTC ATAAATGTAT ATTTATTCAT ATTCACTTCT TATTTAAAGT ATTGACTTGT 



960 



55 



3NSOOCIO: cEP 078651 9A2J_> 



1146 



EP0 786 519 A2 





GCAGAGATTG 


CGCCTAAAAC 


AATTCCX5TTT 


TGAGTCAACC ATGCAAATTG 


TTCACCTAAA 


3960 




CCTTTAAATG 


CTTGTGGTAC 


AGCGCTTATA 


CCAGTACCTA AtCCTACTGA 


TACAGCGATA 


4020 


5 


ATTAATAAAT 


TGTTTTGATT 


TTTAAAATCG 


ATATGTCCTA ATATACTAAC 


ACCATATGCC 


4080 




ATTACCATGC 


CAAACATAGC 


TATCATCGCA 


CCGCCTAACA CAGGTAGCGG 


TATGATATTT 


4140 


10 


GCTAATGCGC 


CAAGCTTAGG 


TATACAACCA 


CATATAAGTA ATAACACGAC 


CATGCCGTAT 


4200 


ATAACATTGT 


TTTTCTTAGC 


GCCGGATAAA 


GAAACAAGTC CTACATTTTG 


CGAATAGGCT 


4260 




GTATACGGAA 


ATGAATTGAA 


TATAGAACCT 


AAcACTATCG CTAGACCTTC 


CGCAGTATAA 


4320 


15 


CCTTTACGAA 


AATCTTTTCT 


TTCTAACTTC 


TTACCGGTAA TTTCACTTAA 


CGCATGATAG 


4380 




ACACCTGTCG 


ACTCAATTAA 


ACTAACXSATA 


GCTACAATAA AGAACACTAA 


CGTCGATGTC 


4440 




ACATCAAAGC 


TAAATCCAGA 


GAATCTAAAC 


GGCACTGGGA TGCCTAACCA 


ACCGGCATGA 


4500 


20 


TTGACTTGAT 


TAATATCGAC 


CATCCCAAGT 


AAGCCAGCAC CTATCGTTCC 


TAAAACGAGT 


4560 




CCAATTAATA 


TGGCAATACT 


CTTAATAAAT 


CCAGTTGTGA ATCTTTGTAA 


AAGAAGAATA 


4620 




ATGATTAATG 


TCATTAAACC 


TAACAAAATG 


TTCTTAACAT CTCCATAGTC 


CTTTGCACCT 


4680 


25 


TGACCTCCAG 


CTAAGTAATT 


CATTGCTACT 


GGCATTAAAT TGATACCAAT 


GATAGTAACA 


4740 




ACACTACCCG 


TTACTACTGG 


TGGGAAGAAT 


TTTACAAGAT GTGAAAAGAA 


AGGCGCGATG 


4800 


30 


ATAATAACTA ATATCCCTGA TAAAAATAGC 


GAACCATAAA GTACATCTAT 


TCCTTTCGTT 


4860 


TGACCAATTA 


AAATCATGGG 


CGCAACAGCC 


GTGAATGTAC ATCCAAGAAC 


GATTGGTAAT 


4920 




CCTGTTCCTG 


TTACTTTATT 


GG CTTGTAAA 


AATGTGGCAA CCCCACACAT 


AAATATATCT 


4980 


35 


ACTGTAACTA 


AGTAAGCGAT 


TTGTTCAGGT 


GTAAACTTCA AACTTGTACC 


AACAATGATT 


5040 




GGAACTAAGA 


TAGCACCTGC 


GTACATAGCT 


AAAAGATGTT GAACACTTAG 


GATTAAATTT 


5100 




TTCATTATTC 


TTCTCCCACC 


AATGTCACTT 


TGTTTCCTTC TAGTGAAGCA 


ACCTTGCAGA 


5160 


40 


GAGAAGAAAC 


TGTTAAACCT 


GCTTCTTCTA 


AACGTTGATG CCCATTTTGG 


AAACTCTTTT 


5220 




CAACAACAAT 


ACCAATACCA 


GCTGTCTTAG 


CATTCGCTTG CTGTGCGATA 


TCGTATAATC 


5280 




CTAATGAAGC 


ATCACCATTT 


GCTAAAAAGT 


CATCGATGAT AAGTACAGTA 


TCTTCTTCTG 


5340 


45 


ATAAAAACTC 


TTTTGAAACA 


ATGACCGTAC 


TTGTTTTATT TTTAGTAAAT 


GAATGAATAG 


5400 




ATGTTTCATA 


ATAACCATCC 


GTCAAAGTGC 


TAGGTTTTGC TrrrrrcGCA 


AATAAACATG 


5460 


50 


GCACATCAAA 


ATGCAGTGCA 


GCCATGATTG 


CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 


5520 


AGATTTTAGT 


AATCCCTTTA 


TCTTTAAATT 


GCTCGTAAAA AGTGCGACCA ACTTCATTCA 


5580 




TTAACTTTGC 


ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 


5640 



55 
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TAAATGAACC TTCTTCATAA ACTGAATTTG GACCACCTGA TAAGATAATA CCrPTTGGAT 
TCATTTTCTT AATTTCTTCA A-nTGAAATTT CATGATCGTG TAATTCACTA TAAACGCCCA 
TTTCACGAAT TCGGCGTGTA ATTAATTGGT TGTATTGGCT ACCAAAGTCT AAGACAAGGA 
TTAACTCTTG TTCTTTTGCC ATTTCCATAT TTGTCGTTCT CCTTTATCTT AATTAGAATG 
AGTAGTTCGG TGATTCTTTC GTAATTTGAA TATTATGTCG ATGGCTTTCT GCTAAACCAG 
CAGGACCCAT ACGTGTAAAT TGTGCTTCTT CGCGTAATTC TCTTAAATCG TGTGAACCAG 
TATAACCCAT ACCAGCACGC ACACCGCCCA TTAATTGGTA AATTGTATCT TGTAACGCAC 
,5 CTTTATAAGC CGTACGTCCT TCGATACCTT CAGGAACAAA nTCTrAGGC GCTrTOTCCT 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT CCATCGCACC TAAAGAGCCC ATACCACGGT 
ATACTTTATA TTGTCTACCT TGGAAAATTT CTCTTGCGCC TCGGCTTTCT TCAGTACCTG 
20 CTAATAAGCT ACCTAACATA ACCGCATGTC CACCAGCAGC TAATCCTTTA AXGATATCTC 

CTGAGAATTT AATACCACCA TCAGCAATGA TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 
CACAATCATA AATTGCTGTA ATTTGTGGTA CACCAACACC TGCTACAACA CGCGTCGTAC 
AAATTGAACC TGGGCCAATA CCAACTTTAA CAATATCTCC ACCCGCTTCA AATAAATCTT 
TTGTTGCTTC TGCAGTTGCT ACGTTACCTG CTACTAATCT GATTTCTGGG TAAGTCTTCT 
TAATATGTTT CACTTGATCG ATAACACCTT TAGAG-TCACC ATGTGCTGTA TCGATAACTA 
AGACATCCAC ACCTGCTTCG ACTAATTTTT GAGCACGAAT ATCAGTATCT TTTCAAATAC 
CAATTGCTGC GGCTACAAGT AGACGACCAT GTTCATCTTT TGCTCCATTA GGGAATTCGA 
TAACTTTTTC AATATCTTTA ATAGTAATAA GACCTTCTAG ACGTCCGTCT TTAACTAATG 
GTAACTTTTC AATCTTATGT TTTTGGAGAA TTTTTTCTGC TTCTTCAAGT GTTCTATTCA 
CTOTAGCTGT AATTAAATTt TCTTGCGTCA TTACATCTAC AATTTTAATC GAGAAGTCTT 
CAATAAAACG TAAGTCACX3G TTTGTTAAAA TACCTACTAA GTTCCGATCT TCTTTATrAT 
CAACAATTGG TACACCTGAA ATACGGTATT TACCCATTAA TGCTTCTGCT TCATAAACGC 
TTTCnTCTGG CGTTAAGAAA AATGGGTTTG AAATGACACC ATTTTCTGAG CGTTTTACTT 
TTTGAACTTC GTCCGCTTGT TCTTCy^CGC CCATATTTTT ATGAATAACA CCTAAACCAC 
CTTGACGAGC CATAGCAATC GCCATTTTAG ATTCAGTTAC AGTATCCATA CCAGCAGAAA 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG ATAATTGTAC GCTTAAATCA ACGTCTTTCG 
GTAAAATATC AGATTGTGCT GGAATTAATA ACACATCATC AAACGTTAAT GATTCTITTG 
CAAATTTACT TTCCCACATT AAAAACAGCC TCCATTTTTC AAATTAATTA GTTATATTAT 
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GTAGATAATC 
TTAACCTTAG 
TACTATCCTG 
ATAAATAACA 
ACACTTGaAT 
CCTAAACAAT 
AAAACGCAAA 
CAAATTAAAA 
TTGTGCTTGG 
TTACGATACG 
TCATACCGTC 
CTCCCATAAC 
TTTCAAGACC 
CTAGTTTATC 
GTTGTCTCCA 
CTTTAAACAA 
CAACATTGTG 
TATATAGTGT 
ATACGTATAC 
TTAATTTATT 
CAC<?5AATTG 
AGATACATGT 
CTACGCCGCC 
CAATTTCGAT 
GGACAAAATT 
GAACACCATA 
CAGCX3ATAAC 
CAGTTTGTTC 
TACCGTATTC 



GAATTATTCC 
AGGTTAACCT 
TAGAATCTGC 
ACCCAGTCCA 
TAGAAAAAAT 
ATGAAATTTT 
CTTAGAACAC 
ACCACCCX3TG 
TGGTTTTGAT 
ACTAGAAATC 
GATAGATGTT 
ACCTACTGAT 
TTCTTCTCTA 
TTCAGTAATT 
TACTAAATGT 
TGTATTGATT 
GTGTGATTTG 
TCCTTGCGCA 
AAATTCATTA 
CATAAAGCGA 
CTCCATAACC 
TAGTTGATCA 
ACTCATCGCA 
TTCGATAAAG 
ATTTAATAAA 
AATGCGACGT 
TTCAAAGCCT 
TGCTGGTAAG 
ACGTTCATTG 



TAAAAATATA 
CTATGAAAGA 
TCTAATAATA 
AAAAATATTT 
AGTAGACGAA 
ATATAAAAAT 
CTTACCCCTT 
CAAACAGGTG 
GTAATGTCAT 
TTTTGTAAGA 
ACTGCACGAA 
TGAATGTTTG 
ATCACTTGGC 
TCTCCAAGTA 
TCTGGAATAC 
GGTTCGATTA 

attgtttgtg 
aggaagtcta 
ccaatgattt 
tctttcgcat 
atgtcxk:ctt 

CCTATTGCTT 
CATAATACAC 
TTTTCCATTG 
TCATTACCAT 
TTCTTATCTT 
TCTGGAATTT 
CCAGCGAATA 
GCACGTTCAA 



ACTCAAAACA 
AACACAATTT 
GAAACAGATA 
ATAATAACAA 
ACTATAATTT 
AAACACGGAA 
TTCCGTCCTT 
GTTTTATTAT 
AGACTACGCG 
CTTCCCAATC 
TACCTACTGT 
GTAACACTGT 
GTAAAATCGC 
CACGAATACC 
CTAACTCAAT 
ATTCGAATTC 
CTGTTTTAGT 
CACCTTTCAG 
TACGTTTTTG 
TAACACGAAT 
CACCTTTACG 
TATGCAATAG 
GACX3GTCTCC 
TCCATTGACC 
ATTCTGTATG 
CGATTGCTGC 
CAATAACTTT 
ACTCATCTGA 
CTTTACCACC 



CAATCTTCAA 
ATCACGGAGA 
CTCCTAGGCT 
CAAATGACTG 
ATGATTTAAA 
AATTGCCTTA 
ATTGTAATTT 
ATATAATTAT 
GTTGACGTGA 
GATGCGTGCG 
GTGATCATAC 
GAAGTATTGC 
GTCTGATTCT 
AAGACCAGGT 
ACCTAATTTA 
CATGTCTTCT 
ACCTGATTCG 
TTTTGATGCT 
TTCAGGATCT 
AATATTCATG 
AAGtAAGCCA 
TACAGCTACA 
TACGCGTTGA 
TCTACAATCA 
ACX5TACTTCT 
ATAGTCTGTG 
ATCAGAATGA 
CTTCGCATTA 
TAATJ\ATTTA 



ATTGAGTAAT 360 

CTTTGCTATT 420 

AATTAATCAT 4 80 

GAGCTTTAAT 540 

ACAAGAAGAT 600 

TTGATGAATT 660 

TCCAACTACT 720 

TCCCACTCAA 780 

TCTACTTCGT 840 

AAGTCACTTG 900 

GTACGGTAGT 960 

CAAATTTCTC 1020 

CTAACGATTT 1080 

CCTGGGAATG 1140 

CGTACTTCAT 1200 

GGTAATCCAC 1260 

ATGACGTCTG 1320 

TCATCATCAA 1380 

GAAACACCTT 1440 

TTGAAACCTT 1500 

TGGTCTACAA 1560 

ACAGATGAAT 1620 

CGAATCTTTT 1680 

CAAACACGAC 1740 

GGATGGAATT 1800 

CTTGGGCTAT I860 

CTCATCCAAA 1920 

ATGATTGCTT 1980 

GTAGTTAATT 2040 
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TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC JKAATGAAGCA 

TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TGCACCGACA 

CCCCAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 

TGTACGAATG CGTTTTGATA GTTATTTTTC ATATAACTAA TTTCACCATT TATCGTCGAA 

CGATCATTAG CTGTATCATG AACTACGATA CCTTCAGGAC GACCTACGCC GTTACGGTAT 

GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 

TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 720 

TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 780 

GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 84 0 

TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 900 

20 TTTTCTGAAT TATCATCTTC ATCATCAACT AAACTATAAC CAGCATTTGT AACATTAGTG 960 

TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTGCGTATT ATTTGTAGTC 

GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 

TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTCTTGATAT 

ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 

GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 

TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 

GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 13 8 0 

TATtACTAAC ATTATAGTAC CTGTnTTATA TACCTGTGCG TA 1422 

(2) INFORMATION FOR SEQ ID NO: 306: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1020 
1080 
1140 
1200 
1260 

13 2 o: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACX; CCATAGTCTT CTGATGACTC 27 SO 

TGAAAAGTCA CCGGTAATAC CATPiPJ^TT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2820 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 2880 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2940 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3060 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

CGTTTTTAAA TACTCGTITA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCXTT GCTTTTCCTC 3240 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 3420 

TCAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 80 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3540 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC OGCGATGTGT 3 660 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 

(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



4S 



so 



ss 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 

CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 240 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA AAGGTAATGC TGAACAAAGT AGATCTGCAT ~ 960 

TTGCATTTTC TAAAAAAGCA GATGATGAAA CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 1020 

AAATCGAGGA AAACGGTGAA TTAGCTAAAA TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 1080 

CTAAATCTAA ATAGCGAACA ACAACATGCG CTAGATGCTG CAAAACAAGC TTTCGGACCT 1140 

ATGCTAGAAG GTTTGGTCAA ATATTCAATT CCTATTACAT TAGTTACATT TGTTTTAGGA 1200 

TTGATTATTG CATTATTTAC AGCATTAATG CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

ATTTCACGTG TCTATGTATC TATTATTCGA GGAACACCAA TGATAGTACA ACTATTTATC 1320 

j5 ATATTTTATG GTATTCCAGA ATTAGGTAGA TTATTAACAA ATGACGCTGA CAACCAATGG 1380 

ACATTGGCAC CTGTAGTGGC TGCTATTATT GGTTTATCAT TAAATGTAGG TGCGTATGCT 144 0 

TCGGAAATTA TTCGTGGCGG TATTATTTCT ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

TcCaTCGGTA TGACGTATGG TCAAACGATA CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

GTGTCGATTC CTGCACTAGG TAATACATTT TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

GGATTTATTT TAGTGGCTGA AATGTTTAGA AAAGCTCAAG AAGTTGCGTC TACAACATAT 1680 

GAATATTTAA CAATTTATGT GTTAGTTGCG CTAATGTACT GGGTGGTATG CTTTATTATT 174 0 

TCAATTATCC AAGGTATCTA TGAATCTTAT ATTGAAAGAG GGTATCGCTC ATGATTCAAT 18 00 

TGAACAATAT CCATAAATCA TTTAATGATG TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

TACGTATGAT TAATGCATTA GAAATTCCAA CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1980 

35 CATATACATC TAAAGATAAA AAATCACAAA TAGAAGTTCG TAAACAGTCT GGTATGGTAT 2040 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

TCACAGTTAA AAAGTTGAAA AAGGATGAGG CACXJTGGGAA ATCACTTGAG TTACTTGAGA 2160 

AAGTTGGTTT AACACATGTC AAAGATCAAC GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

AACGTGTTGC TATTGTCAAG AGCACTAGCA ATGAACCCTA AAGTGATGTT GTTTGATGAA 22 80 

CCAACATCTG CACTTGATCC TGAACTTGTG AATGATGTTT TAAAGGTTAT TAAAGATTTG 2340 

GCTAATGAAG GCATGACAAT GGTCATTGTG ACACATGAAA TGCGTTTTGC TAAAGAAGTA 2400 

TCTAATAACA TTGTATTTAT TCmTGAAGGC ATGATCGGAG AACAAGGGGC TCCAGAAGAG 24 60 

ATGTTCAATC GTCCGAAAAC AGAAGAATTA AGACGTTTCT TAAATGTTAT AAATGAAGAA 2520 

TAATCAAATA GAACCACGTA TCATGTTTTA GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

TGGTTCTTTG TTATGTTGTC ATAATCTTGG AGCGATATTT TAACGACGTT TATGATTTAA 264 0 

55 
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TTCAAAATTA 
AGTAAACGTC 
TTTAGGCTAT 
TGGAAATACA 
AAGTCGCAAC 
TAATGGTTGG 
CTTAGCTGGC 
GCCATTATTA 



ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT 
CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC 
CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC 
GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA 
AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT 
AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA 
GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT 
GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT 
ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT 
ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA 
TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA 
ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA 
ATTGACTTAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT 
AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT 
CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC 
ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT 
CTGGTTACGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA 
TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAQA CGCAGGGCGT 
TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT 
ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT 
ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT 
AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG 



TATTGGGGAG 
GTTGATTATG 
ACATTTGGTG 
GCTTTTTCTG 
ACAAATTATA 
GGACCTTATG 
AGACAAAGCA 
TCTAGAAGTA 



GATATGATGA 

TTTGCATTAT 

AAAGAAGTTG 

AGTTCAACAT 

AATAAGAATA 

TTCATATTGG 

AAGACAATTA 

GGTAAATTAA 

AAACTTAAAT 

TTTGATGTAA 

TCTAAGCCTT 

AAAGATTTTG 

AAATTAGCTA 

GATTTATTAT 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3159 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 
CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 
TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 
TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 
TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 
TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTCCAAT CGGTTACCTT 
AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 
,5 GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 
AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 
20 TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAJVTCC 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 
AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 
TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTCATAT TAAAGATCCA 
AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 
TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATCTTTCTTT 
AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 
CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 
35 TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 
AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 
TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGAITTAACC CTTCATTTAT 
CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC GAATTTCAAA TTACTTACGG 
AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATCGCA ACAGTTATTT 
AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 
CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 
5^ TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 

CTGAAGGTAA AATAACACCA GTCAGCGTAA MAAAGTCGA TGACAAAGTT ACTTTATACA 
AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6660 

AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6720 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 6780 

TAAAAATGCA AAATTTCCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 6 84 0 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6900 

TATTCGTCTT ACTACTATTC ATAATTTACA ITTTCTGCTA AAATTAATGG AAGATATAAG 6960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 7140 

ACAAAAACTG TGCGAAACA "^^^^ 
^0 (2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

TCCATTTATA GAAATTTCTA PJiGCAGAnl^ TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AATCATCAAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 120 

ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 240 

GATi^GGCCA TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 300 

TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 360 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 420 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 480 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 54 0 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 600 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 66 0 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAGTTT GAAATCACTT ACGGCAGAAA 720 

CATGGATGCT ACATATGCTT ACGTGACAAG ACATCGTTTA GCCGTTGATA GAAAACATGA 780 
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GGCAGAAAAG 
TAACGTGAAT 
TTTAAAAGAT 
ACATTTACAT 
CGATACGCGA 
TGAAATGTTA 
TAAGCGTATT 
CATAAAAGAA 
AGAAAGATTA 
TGATCCAGAT 
AGCAGGATTA 
CGCATTTGTT 
GAATGACCAC 
AAATGATACT 
CTTGAAACAA 
TTTATTTATC 
CCAAAATCAA 
GCTTATAAAA 
ATTATATAAA 
AACTTGTAAA 
AACACCTATG 
GTTAAGACAA 
CGGAAATGAT 
GATTCTTACA 
AGAAGAAGGC 
GAAATCAATG 
TCCACCGATG 
GGCGAAAAGA 
ACAAGGTGGC 



CAACACCATT 
ATTGAAGAAT 
CGTGATCATA 
TTCAAAGATA 
GTAATGCCAG 
ATGTtAACTC 
AAAGTTGGTA 
ATGGATCAAG 
GATGAATTAG 
CGTTATCAAA 
CATTTTACTG 
ACATTACATG 
GAAATGCATA 
AAGcCAAAGG 
TTCGACGCGA 
CAGGATTTGA 
CATTAGTTAT 
CGGCAGTAAA 
AAGAATGTGA 
CAATCAGGTG 
TTTATGCCAG 
ATTGAAGCAA 
ATTATCAAAC 
GATTCAGGCG 
GTGGAATTTA 
CAAATTCAAA 
CCTGCTGAAT 
TGTCTAGATG 
GAATATGAAG 



AGCTTATGAA 
TTGACTATGA 
GTCGTTTATT 
TCATTGAGTA 
CTAGACTTTT 
AAATTGAAGG 
ATAAATTGAA 
GTGGACGCAT 
GGGAAATGCC 
CAGTTTACGC 
ATGAGTTATT 
TTGGGTTAGG 
GTGAATATTA 
ACATCGCATT 
TCATGATAAA 
TTTTAAAGCA 
GCTAGTATCA 
TTTAGAATAT 
GGATTTTGAA 
CGCGTTTAGG 
TTGGTACTAA 
AAATCATTTT 
ACGCTX3GGGG 
GTTTCCAAGT 
GACATCATAC 
ATGATTTAGG 
ATGATTATGT 
CACACCAAAG 
ATTTAAGAGA 



CATTTTGCAA 
CTTACCAGAA 
AGTCATGGAT 
TTTTAGACCT 
TGGTTTAAAA 
TAATGATTGG 
TTTTGGTAAT 
CATGCGTTTA 
ACTGCCACCA 
TAAAGAAAGT 
AATTGAAATT 
TACGTTTAGA 
TCAAATGACm 
ATATCAGTTG 
TTTGTTGAAA 
ATTGATGGCC 
GCGTTTAGTA 
AGATTCTTTA 
TATGCCTGCA 
TATCGTGCAC 
AGCAACCGTT 
GGGCAACACA 
ATTACATAAA 
GTTTAGTTTA 
TAATGGGTCT 
ATCTGATATT 
AAAAAAATCT 
ACCTGAAGAT 
ACAAAGTGCA 



AGTCGAATGA 
TCATTAATTG 
AGAGAAACTG 
GGTGATACAT 
GAAGAAACTG 
GAAGTCTTAC 
GGCAAAATTA 
CATTATGAAG 
TACATCAAAG 
GGTTCAGCGG 
AAAAATAAAG 
CCGGTGAGCG 
CAAGAAACAG 
GTACAACTTC 
CGAGTGGCTG 
AGATTACTAA 
GTCGTGAAAA 
GTTTTGGCGA 
GTAACATACG 
ACACCACACG 
AAAACAATGA 
TATCATTTGT 
TTCATGAATT 
AGTAATTTAC 
AAATTATTTT 
ATGATGGCAT 
ATTGAACGTA 
CAAGCATTGT 
AAGGATTTAG 



GGAGAGAGAA 
CTCAAACGCC 
GTGAAATGAA 
TAGTGCTTAA 
GTGCAAAAGT 
TGAAACCAGC 
TAGCTGAATG 
GTATTTTACA 
AACGTTTAGA 
CAGCACCAAC 
GTGTTAATAT 
TAGACGATGT 
CTGATTTATT 
AACACGTACA 
GACTAATATA 
TTTTCATTTA 
TGTTCTGAAT 
TGCAATGTTA 
AACACATTAA 
GTTCATTTGA 
GTCCAGAAGA 
GGTTACAACC 
GGGATGGTCC 
GTAAAATTAC 
TGAGTCCTGA 
TTGATGAATG 
CAACACGTTG 
TCGGCATTAT 
TAGAATTAGA 



4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

606O 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 



55 



BNSOOaO: <EP_07e6519A2_l_> 



1136 



TO 



IS 



20 



EP 0 786 519 A2 

TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 30S0 

GATTCATCAA AGTATTCCAA TGGPAGPJiPA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 3300 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTCTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 3480 

TAGCTATTTT AGOGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3G00 

ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

25 CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 384 0 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3900 

ACTCATTTCG AATTCTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

^° ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATCTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

TTCGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 4260 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 4380 

GTCCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 4440 

GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACX3TTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

SO GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4680 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCX5TAACAA TTGGTGAAGA ACGTATTACA 4740 
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AGACGTTATA CTAAATGTGC AcTGTATAAG AGCCCCTAAT CACTAAACTA TAAGGGGGAC 12 G 

IKAAGGMTAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 132 
TAAGTATTTT AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 13 8 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAAA TGAAAATTAT ATGTAAAAGA 144 
GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGGTG ATGGTGGTAA ISOi 
TGGTATTACC GCATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCT^G CTGGCGGTGA 15S, 
CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 162( 
,5 TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 16 8 C 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 174 c 

TGTTGAAACA GACGAAgTGT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 180C 
GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTSCACC 18fiC 
TGACTTCAGT GAAAAAGGTG AACCAGGTGA GGAATTAGAT GTATCTTTAG AATTGAAATT 1920 

ATTAGCTGAT GTAGGATTAG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 1980 

CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAACGATTA AACCAAATCT 2040 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG GCAGATTTAC CAGGTrTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 2160 

AAAAGTTATT GTTCACATGA TTGATATGAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 222 0 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG ATAGACCTCA 2280 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2340 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 2400 

TGATCAATTA TTATATGCAA TAGCAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 24 60 

AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 2520 

AGATAAATTT ACAATTTCAA GAGATGATGA TGGTGCTTAT GTCGTAAGTG GTAATCCTAT 2580 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC GATTTGCTCX; 264 0 

45 TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATCGTCA 2700 

TATCGTTAGA ATTCTTGGCG GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 2760 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2820 

50 

TAAAACATTG AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 2880 

CGTTAAACAG ITTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CX3ATATTTCC 294 0 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3 900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCAC7VTGT GCACCATGCT TGCCACCTCG 4 020 
GAACCTATT 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

GATGCAATAG TTGAGTAGTT ATAAGTAGCT ACATACAACC GCTCAAATAT AGGTTCAAGA 60 

ACATTTTTAA TGAAGAAAGC GATTTTCCCm CAGAGTGCAA ACGCTTGACT CAATAACGAA 120 

ATGTCACCTA TGTATGGCGT GACTTATTAT ACAGGaGGTG CAAAGTATGT TTGCTATTAT 180 

25 TGAAACAGGT GGAAAACAAA TCAAAGTAGA AGAAGGTCAA GAAATCTTCG TTGAAAAATT 240 

AGACGTAAAC GAAGGAGATA CTTTTACATT TGATAAAGTA TTATTTGTAG GTGGAGATTC 3 00 

AGTTAAAGTT GGAGCGCCAA CAGTTGAAGG TGCAACAGTT ACTGCTACTG TTAATAAACA 360 

AGGTCGCGGT AAAAAAATCA CTGTATTCAC ATACAAACGT CGTAAAAATT CAAAACGTAA 420 

AAAAGGCCAT CGTCAACCAT ACACTAAATT AACAATCGAT AAAATCAACG CGTAATTATT 4 80 

ATGATTACTG TTGATATTAC AGTTAATGAT GAAGGCAAAG TAACAGACGT TATTATGGAT 540 

GGCCATGCTG aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 600 

TTG-FrrGGTA GTGTTAATGC GATTATAGGA TTGACATCTG AGAGACCAGA TATCAATTAT 660 

GACGACAATG GTGGTCATTT TCATATAAGA AGCGTTGATA CAAACAACGA TGAAGCGCAA 720 

CTAATTCTTC AAACAATGCT TGTGTCITTA CAAACTATTG AAGAAGAATA TAATGAGAAT 780 

ATTAGATTAA ATTATAAGTG AGGTGCATTC CGATGTTAAA ATTAAACTTA CAATTCTTCG 840 

CATCTAAAAA AGGGGTAAGT TCTACAAAAA ACGGAOGTGA CTCTGAATCA AAACGCTTAG 900 

GTGCTAAACG TGCTGACGGT CAATTCGTAA CAGGTGGTTC AATTTTATAT CGCCAACGTG 960 

GTACTAAAAT TTACCCTGGT GAAAATGTAG GTCGTGGTGG CGATGATACA TTATTCGCTA 1020 

SO AAATCGACGG CGTTGTTAAA TTCGAACGTA AAGGTCGCGA CAAAAAACAA GTTTCTGTAT 1080 

ATGCAGTAGC TGAATAATTT TGTCTAGTTA ACACCAGAAG TGAATCTTCT GGTGTTtTTT 1140 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACACKSAAT ATTTAATTCG TGCAAAGATT 
TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 

G-rrrGTCACC gtcattattc aaacggctta aattaacttt attataatga ttcgttaacg 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 

cagatttatg tacatttctt gccattaaga tatcgccttt tttctctgaa aaagactgga 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 
GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 
AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 

ATArrm-AT gacccggaac atgcaaagaa atcgcttctt cttgatttaa actttctaat 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TriGGAATTA 
GTTACnrCA AAAGTATTAA TTATATAGTA ACACTTCTrr GACAAAAGTT AGTCTTACTr 
ATGCAATAGC TTCTCTATTG TATAATAATT AATTTCCTTT a-rGTACrrCG ATTTAAAAGA 
TATTAGACAT AAAATCTAAA AACAGCAGTA AGATCATTTA TGATTAAAAA CTATCTTACT 
GCTCTTCACT TTTTATAATA CTTCTGAATXS TCTTCACTTA TACTTCTAGT CACAGATTTA 
AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 

30 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 
ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCX3TCAGA TTCAAACGTT 
TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTCACGTT TTAGACATAA 
AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 
TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTOAC AATCGCTTGC TTCTTTCCTC 
TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TrrCTCGTTT 
CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATmrC TTTGTSTTTA CITTTTATTT 
TGACGTTTTA GrCATAAAAA AAAGAGACCT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 
45 TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 
ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTrC GTCAGATTCA AACGTITrCA 
^ CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTrT GACGTTrTAG ACATAAAAAA 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TnTGCCTGG CAACGTTCTA 
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TTATTAATAC 
TGGAATGCAT 
TATGGAATGT 
CTATTACCAC 
CCACATAAGA 
GCTCTAAAGT 
ATCGCTATAA 
AAATATTTGA 
TTAACGCTTT 
TTTTCGTGAA 
TACGTTCGCG 
CATTTATTGC 
CTTGATAAGC 
AAGCTGGTAT 
CaGTtCTAAT 
GGAACACCAC 
ACTTCATTAA 
GACATTTTAT 
AGTTTCCAGA 
AATAATAGGA 
TTTTcAAATC 
TTAAAAGTTG 
TTTTACGTAA 
CTAATGCTTG 
GAATATCATG 
GAAGCATTTC 
ACACGGTACT 
AATACGAAGG 
GTGCATTTTT 



ACTATTTTAA 
CAACTGGCAT 
ACGAATCCGC 
ACGTTTGATT 
ATGTTGTATT 
TATTTTTAAC 
TCATTTTCAT 
TGATAGTTTG 
TGAACCGTTG 
cTTTAAATCT 
ACCTACTTCA 
AAATTCGTTT 
TAATGAACTA 
GACCTTTAAT 
GTCCATATCA 
CTGGTTCTCT 
TTACAGTTGT 
TCATCCTCAA 
TATTCATTTA 
ATcCCTGGcG 
ACACCAAGTA 
TGTTTGTTTA 
AAGCGAATCA 
ATAGTCGTCC 
ACCTGTAAAA 
AAAACCCTTC 
ATCATATGTT 
ACTAGATGTT 
ATGAATATAA 



GTATCTCTAA 
AACAAATACA 
ACTGCCTCCC 
AATCACAGAC 
ACCCGCTCTT 
AAGTTGATCT 
TTTATAACAC 
ATACGTGTCT 
TGATTCATTA 
TCTTGATCTA 
GCACTAACAT 
AATGCTCTTA 
TCGATATAGC 
ACAAGATGTT 
TTGCCcTTCT 
AGTCATAATG 
TTTTCCAGAG 
CTAAAATTTT 
CCAATTCTAT 
GATACGGGAC 
ACATATTCAA 
ACTTTAGAAA 
AATAAATACG 
GCTAATTCTA 
CCTTCATATT 
TTCTCCAAAC 
TTATAGAACT 
TGGAAGTAGC 
AGTACCGAGC 



TTGATGATTG 
GTAGCACCGC 
ATAGGTGTAA 
AATATTTCAT 
AAT^AACCCAC 
GCAAGTTCCT 
CTCTTCTAAC 
TCAACAACAT 
TGAATGATTT 
ATCTATTTTG 
TTAAATAAAT 
CTTCTTCAAC 
GATCACACAA 
CTCTTCTAGA 
AATACAATTT 
ACATCATAAT 
CCTTCTGGGC 
ATTATTTTTA 
CATATTTTCA 
AATATGTCGC 
AGCGTTTTGG 
CTGATTTTTT 
TATCATCATG 
AATAGATATG 
TTATCAGCAA 
ATTCGATTAA 
CGGCAGCTGA 
TTAGATATTC 
CCATCX3TTAA 



TATTATAGAA 
CAACTTCAAC 
TAGGTGAAAC 
CTACACGGTC 
CTGTTGTTGC 
GACTATCTTG 
AATTATATCA 
TTTCAAGAGG 
CTTGGTAACC 
ATCTCTTGAA 
CGTCAAGTCT 
GCCAATCCCT 
CACAACCTTA 
TGCAGCAAAT 
TACGTATTTC 
CTTTTACTAA 
CCTCAAAAGT 
ATTCCTTCAA 
GTTATTGTTT 
GCCAGAACTT 
TTTATAGTTA 
CXX5TAAAATC 
ATGCCATAAC 
TGCATTCATA 
CTTCAACGGA 
TTGCGCTCTC 
CTCTAAACTA 
TATAATAGTT 
AGCTGGTAAC 



TTAAAATTGA 300 

TTCAACTGGA 360 

CAACTGTTCT 420 

ATCATTGACA 4 80 

CAATTTTGTT 54 0 

ATCTTGTACG 600 

TATCTTTTCT 660 

TTGATCTGCA 720 

TTCAATTACT 780 

TTTTTAATAA 84 0 

GGaTATAATC 900 

CTAGCATAAC 960 

CCTTCTTTTA 1020 

AACATTGCTT 1 0 80 

TTCACCAGTA 114 0 

TCTATGGTAA 1200 

TATAAAAGCT 1260 

CTATCATTCC 1320 

CTCCTTTQAA 1380 

TACCTTTgCC 1440 

CCTTCAGTyG 1500 

ATATCTTCAA 1560 

GGCAATATCG 1620 

AACCAATTTT 1680 

TCATCAACTT 1740 

TTATCAATVAA 1800 

GCCATAATCA I860 

TCTCTATAAG 1920 

GTTTTATGAA 1980 
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TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 
AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 
AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 
ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 
TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 
AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 
AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 
AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 
aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 
TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 
20 AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 
ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 
25 AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 
ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 
TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 
TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCX;C 
ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 
T 

(2) Information for seq id no: 301 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTCCTT TTATATAAAA AGGTTCCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TGCTGTTTCC 


ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 


300 




CAATGCTCGG 


ATATTCTTAG 


TTCTTGGATT 


AGCACTCACT 


GCTATCGTCA 


ATTTGTTAAT 


360 


5 


GGGATTTGTA 


CCGTTCTTTA 


CATCAGGTAT 


CGGTATTATG 


TTTGTCCTAT 


TATTCTTAAA 


420 




TGGATGGTTT 


CAAGGTATGG 


GCTGGCCACC 


TTCAGGCCGT 


GTTCTCGTTC 


ACTGGTTTAG 


480 




TGTAAGTGAA 


CGCGGAAGTA 


AGACTGCCCT 


TTGGAACGTT 


GCGCATAATG 


TTGGTGGAGG 


540 


10 


TATTATGGCA 


CCTATTGCTG 


CTTGGGGTAT 


TACAACAACT 


GCATTTATCA 


ACTTTGGTTA 


^00 




TTTAAAAGGT 


TTCGAAGGTG 


TATTCATTTA 


CCCTGCACTC 


TTAGCACTTA 


TCATTGCCGC 


660 


15 


AATTTCATAC 


GTATTGATTA 


GAGACACACC 


TCAATCTCAA 


GGTTTACCTC 


CAATCGAAAT 


720 


TTATAAAAAT 


GACTTTGCTA 


CAAGCGATAA 


GAAAACATTA 


GAAACAGAAT 


TAACTACAAA 


780 




AGAAATTTTA 


TTTAAATATG 


TACTGAACAA 


TAAATGGGTA 


TGGGCAATTG 


CCTTTGCAAA 


840 


20 


TATATTTGTT 


TATTTCGTGC 


GTTATGGTGT 


ACTTGATTGG 


GCGCCAGTCT 


ACTTAAGTGA 


900 




AGAAAAACAT 


TTCGACTTAA 


AAGCATCAGG 


TTGGGCATAC 


TTCTTATACG 


AATGGGCTGG 


960 




AATTCCTGGT 


ACATTATTAT 


GTGGTTACAT 


TTCTGATAAA 


TTATTCAAAG 


GTCGTCGTGG 


1020 


25 


ACCTGCAGGT 


TTCTTCTTTA 


TGTTAGGTGT 


CACAGTATTT 


GTATTAATTT 


ATTGGTTAAA 


1080 




TCCTCCAGGC 


AATGCTTGGT 


TAGACAATGT 


CTCATTAATT 


GCCATTGGTT 


TCTTAATATA 


1140 




TGGACCAGTT 


ATGTTAATTG 


GTTTACAAGC 


ATTAGATTAT 


GTACCTAAAA 


AAGCAGCTGG 


1200 


30 


CACAgcAGCT 


GGATTAACAG 


GATTATTTGG 


TTATCTGTTT 


GGTGCTGTAA 


TGGCCAACAT 


1260 




CGTCTTAGGT 


GCTGTAGTTG 


ATAAATTCGG 


ATGGGATGTC 


GGTTTTATTT 


TATTAACAGC 


1320 




AATTAGTGTG 


TTTGCAATGT 


TGAGCTTTAT 


CCTCACTTGG 


AATAAAGTAG 


GACAAGAAAC 


1380 


35 


CGTTCATCAT 


TAAATGATAA 


AAAATAAAGT 


CATATGGTTA 


TCTTATCGAA 


AGATGATATA 


1440 




TTCATCTCTT ATAAGTTCAA 


CCATATGACT 


TTTTATTAGT 


ATTCAAAAAA 


ATATTTACAT 


1500 


40 


TGCCACTTTT 


GTGTTTGCCC 


TGCTGTTTTA 


TTCAATTGAT 


TACACCACTT 


AGGATAAACT 


1560 


CTAAAAGCCA 


TTTTCCCTTG 


ATACTTAGAT 


GAAGCTAAGA 


TACcrrrrrc 


TACCAATAAT 


1620 




TCCCTAGGAA 


ATAGAAAGTA 


GCCATTTAAT 


TCATCATCGA 


TAACAGCAAC 


AATCAGGTAA 


1680 


45 


TCAGCAAACT 


CTTCAATTTT 


GTATGGCCGA 


TTATAATTAT 


CtTCGTCTTT 


TGTCCAACAT 


1740 




GTCACGAAAT 


ATCCCGATTT 


AGTCGGTGTT 


TTCTTAGCTA 


ACCTACTTTG 


ATATGTTTCC 


1800 




TCTTTAAAGC TAAATGTTAA TGCyTCGTAA 


TCTTGATTAT 


ATTTTTCTTC 


AGTTAAATCT 


1860 


50 


TTAACTTCTG 


ATTCTTCACT 


AAAAATATTT 


TTCAACAGTA 


TTTTAGATTT 


ACACATATGC 
AAAATACACT 


1920 
1980 




CAATCTCACT 


TTA'rrrrrrc 


ATAATCGTAT 


CATATATTTA 


'rrrrmcGA 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTG AATCTACATT 3G0 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 366 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 372 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGct GG CGCAACTCGT ATGATTGCGT 3791 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 3 841 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3 90 1 

TCGTTGTTcA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAGCTG ACATTTTAAA 3 96< 

,5 AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4 02C 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 408C 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 414C 

TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTCTGCTAT 42OC 

AAAAGAATTA ATCGTAACAA ACTCAATTCA TTTAGATGAA GATCGCAAAC CATCTAACAC 426C 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 4 320 

ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 4380 

TGTATAATAG TTTCGTTCGT GATTATACGA ATAAATAAAC ACTTGCAAGC AACGATGATG 4440 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGGAAAT GAGAATGGCT 4 500 

TCATTAAAGT CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4560 

AAATCTGGTA AAGTACCAGC AGTAGTATAC GGTTACGGTA CTAAAAACGT GTCAGTTAAA 4 620 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 4630 
TTAGGCX3TTG QTTCTAAAAC TATCAAAGTT ATGGTTGC 
(2) /information for SEQ id NO: 300: 

40 < i ^ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA €0 
TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 
ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 
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TGaTGCTCXST 


ATTTTTGAAG 


TAAGAAAAAA 


GTTGTTTTTA 


AAATTACAAC GAATTAAAAA 


1800 




CAATGCCTTT 


TATATGTTGA 


AAGAGTATTG 


CaGATTAAAT 


TaTAATAATG ACGAaGgTAA 


1860 


5 


AATTTAATGG 


GGGTTAATGT 


TCATGCGAAG 


ACACGCGATA 


ATTTTGGCAG CAGGTAAAGG 


1920 




CACAAGAATG 


AAATCTAAAA 


AGTATAAAGT 


GCTACACGAG 


GTTGCTGGGA AACCTATGGT 


1980 




CGAACATGTA 


TTGGAAAGTG 


TGAAAGGCTC 


TGGTGTCGAT 


CAAGTTGTAA CCATCGTAGG 


2040 


10 


ACATGGTGCT 


GAAAGTGTAA 


AAGGACATTT 


AGGCGAGCGT 


TCTTTATACA GTTTTCAAGA 


2100 




GGAACAACTC 


GGTACTGCGC 


ATGCaCGCAA 


ATGGCGAAAT 


CACACTTAGA AGACAAGGAA 


2160 


IS 


GGTACGACAA 


TCGTTGTATG 


TGGTGACACA 


CCGCTCATCA 


CAAAGGAAAC ATTAGTAACA 


2220 


TTGATTGCX3C 


ATCACXSAGGA 


TGCTAATGCT 


CAAGCAACTG 


TATTATCTGC ATCGATTCAA 


2280 




C7UVCCATATG 


GATACGGAAG 


AATCGTTCGA 


AATGCGTCAG 


GTCGTTTAGA ACGCATAGTT 


2340 


20 


GAAGAGAAAG 


ATGCAACGCA 


AGCTGAAAAG 


GATATTAATG 


AAATTAGTTC AGGTATTTTT 


2400 




GCGTTTAATA 


ATAAAACGTT 


GTTTGAAAAA 


TTAACACAAG 


TGAAAAATGA TAATGCGCAA 


2460 




GGTGAATATT 


ACCTCCCTGa 


TGTATTGTCG 


TTAATTTTAA ATGATGGCGG CATCGTAGAA 


2520 


2S 


GTCTATCGTA 


CCAATGATGT 


TGAAGAAATC 


ATGGGTGTAA 


ATGATCGTGT AATGCTTAGT 


2580 




CAGGCTGAGA 


AGGCGATGCA 


ACGTCGTACG 


AATCATTATC 


ACATGCTAAA TGGTGTGACA 


2640 




ATCATCGATC 


CTGACAGCAC 


TTATATTGGT 


CCAGACGTTA 


CAATTGGTAG TGATACAGTC 


2700 


30 


ATTGAACCAG 


GCGTACGAAT 


TAATGGTCGT 


ACAGAAATTG 


GCGAAGATGT TGTTATTGGT 


2760 




CAGTACTCTG 


AAATTAACAA 


TAGTACGATT 


GAAAATGGTG 


CATGTATTCA ACAGTCTGTT 


2820 




GTTAATGATG 


CTAGCX3TAGG 


AGCXSAATACT 


AAGGTCGGAC 


CGTTTGCGCA ATTGAGACCA 


2880 


3S 


GGCXKXsCAAT 


TAGGTGCAGA 


TGTTAAGGTT 


GGAAATTTTG 


TAGAAATTAA AAAAGCAGAT 


2940 




CTTAAAGATG 


GTGCCAAGGT 


TTCACATTTA 


AGTTATATTG 


GCGATGCTGT AATTGGCGAA 


3000 


40 


CGTACTAATA 


TTGGTTGCGG 


AACGATTACA 


GTTAACTATG 


ATGGTGAAAA TAAATTTAAA 


3060 


ACTATCGTCG 


GCAAAGATTC 


ATTTGTAGGT 


TGCAATGTTA 


ATTTAGTAGC ACCTGTAACA 


3120 




ATTGGTGATG 


ATGTATTGGT 


GGCAGCTGGT 


TCCACAATCA 


CAGATGACGT ACCAAATGAC 


3180 


45 


AGTTTAGCTG 


TGGCAAGAGC 


AAGACAAACA 


ACAAAAGAAG 


GATATAGGAA ATAATCATTT 


3240 




ACGTATTTAA 


AATGGCTAGG 


ATAAAAGGAT 


AATCCTATGT 


AATATTAATG TAATCTTTAT 


3300 




GATTTAATGA 


TTCGCATAGT AATGGAGTTA CATyTTATAT 


ATAATAGTAA TTGCX3TAAGT 


3360 


SO 


AAATAATTGG 


AGGACTATAA ATGTTAAATA 


ATGAATATAA 


GAATTCGTCA TTAAAGATTT 


3420 




TTTCATTGAA 


AGGAAACGAA GCATTAGCGC 


AAGAAGTTGC 


• TGACCAAGTA GGAATTGAAC 


3480 
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30 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 
ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG 
CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA AAAAGAAAAG 
TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATCATGA 
GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC 
TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA 
TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA 
AAAAGGTATT TCATTGGCAA ATGCGGTTCC TAATATTTTA AAlTrACCAG 
TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGrUTCA ATTAATTACG 
TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA 
TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AA-RSGTGTTA 
GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA 
ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG 
TCaAGAGTTT AACGTAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA 
GTATTATGAA AATCATTAAC ACAACAAGAT TACCGGAAGC ACTTGGACCA 
CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT 
ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT 
TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCGAA AGCGACCATT 
ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT 
CAGCGCGTAG TTGTGTAGAG GTTGCGCGTT TXSCCAAAAGA TCTGAAAGTA 
TAG-^GTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA 
AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAOTGACAG 
TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT 
TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG 
AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA 
GAGACAAGAA ATTCAAGATG CAGTGATGAA AGTATATGAT GAAACAGATG 
AGATAAAAAC GCTACATCAG AAGATTCAGA AGAAGCTTAA TCAATTTTAT 
GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC 
GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA 



GCGAAGTCTT 
TTAGGTACAG 
GTAAAGAAGA 
GTTTGTTAcC 
ACAAAGTTGG 
CCATTGCGAC 
TAGTAGTGAT 
TTTCAGGATC 
ATTCAAATGT 

AAGAAGTTAA 
ATGAATATAA 
AGGAGTTTTA 
TATTCGCATG 
ATTGATGGAC 
TTAAAGGTTG 
TTCATTAAAG 
GAACACAAGC 
GAAATTGAAT 
GCTTCTAAAT 
ATGTAAGACT 
TAGATGAAGC 
TTGCAATGCC 
ATTCAGATAT 
AAGTAGTACC 
ATTTAGCGAT 
TTTGTCGGCA 
AATATATACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA 


ACAAAACGAT 


TACGTTTTAG 


27G0 




cATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT 


ATTTTTAATA 


ACATTGATTA 


2820 


5 


TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA 


TATCAAAAAG 


CCATTAAGAC 


2880 




GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT 


TTATACTGGT 


GCACTAGTGa 


2940 


10 


GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT 


GCCATTACCA 


CGATCTTGTA 


3000 


CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT 
/ *D \ TMcriDMZiTTnM FOR ^KO ID NO- 298: 


ATCAnGGTCn 


nTTAA 


3055 


IS 


( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 298: 








TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT 


TTAAAAAATG 


ATCCGTATAA 


60 


25 


AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA 


GAGCGATATT 


CAAGAAGAAT 


120 




TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA 


AATAAAGAAG 


TATTAATACT 


180 




ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC 


TGAATAACTT 


TAATGATAAG 


240 


30 


TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC 


GACGTTAATG 


CGTTGTTATC 


300 




TCTACACTTT TTAATTTTAT AATAGCGCAA GACTAAACAG 


ATTGAAATTA 


GTAACAATAA 


360 


35 


AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT 


TTTTCGGAAT 


ATTAAATTTT 


420 


AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT 


AAAAATATTT 


AAGCCTGAAA 


480 




ATGOTAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA 


TACAATAGTT 


TTAAGTTGAA 


540 


40 


ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA 


CTTAAACATT 


TAACAGATGT 


600 




AAACGGCATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA 


ATGCGTaACT 


ATATAGAGCC 


660 




TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT 


TTTGGAAAGA AAAATGCTGA 


720 


45 


GAATGGTCAA TACTCAATTA TGGAnTTC 
(2) INFORMATION FOR SEQ ID NO: 299: 






748 



(i> SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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TAACATTTTT 
AAATTGCACT 
TTTTCATCCT 
CGAAAAAGAT 
GATCTGAATT 
TrGCTAACAC 
TAATCAACCA 
TTGGAATATT 
TTGGtTGCCA 
CAATTGCAAC 
TCCATGTTGT 
TAAGACCAAG 
TCATACCAAA 
TATTTTGTTT 
ATGCGCCACC 
ATAAGAAGGA 
AGTTACCTTG 
TAATATTTTG 
TAAGATTATG 
AAATTAACGA 
TAATCTATAT 
TAAGCTTCAA 
ATTAAGTTCT 
GTTGGATCCC 
GGCTATTAAT 
GCAAAAAGAA 
TTGGTGGAGC 
TATGTCATGG 
ATAGAGCCGT 



GTGTTTATTT 
AATGACAATA 
AACTTAATTA 
CACTAAATAA 
TTTCTTAAAT 
TaCGAATACG 
GATAAACATA 
TGCAAGTGCA 
ACmAAATACA 
CCATCCAATT 
ATGTCTTTTA 
TACGCCTGAT 
ACTTAATAGT 
ACTAGGCATA 
CATAATTAAA 
ATGATTTGTC 
AACAAGTCCC 
TGACmAAGTA 
ATATCTTAGA 
CAAGAGAAAA 
GTTAGAAAAT 
ATTTGAGTAA 
TTTTATGTCG 
ATCTTTTCAT 
TCTAATTCTC 
TTTAAAATGG 
CTTAGTTACC 
TGCGTTGATT 
TTCAGCTTTG 



ATTTCCCTTT 
CATGTCGTAC 
ATCTAGATCA 
TTTAGTGAAT 
GTTGTTAAAC 
ACACCTAAAT 
CTGACACGTG 
TATTCATCTT 
ACTAAAAACA 
AGTGGTGGTA 
GACCATATTG 
GGTATATTCA 
AATAAATTTT 
ATACGATCAA 
GTAGATCCAA 
ATTACAACTG 
ATTTTAATTA 
TGCTCTTTGC 
TTGCATAAAA 
TGTCAATTTT 
TTTAATAAGT 
ACGTGGATTA 
TCCACATACA 
ATCCTACAAT 
AGTGATGCGT 
TTAGGTGTCG 
AAAACCGGAT 
CCAGAATTCT 
TCTTTATTAA 



TCCAAATAAG 
TAATCGTTGG 
AAGTAAGTAA 
ATATAAACAT 
CTAATGCAAT 
TTATTAGTAA 
TACGTTTAAA 
TACGTTTAAT 
GCGCAATCGC 
CTGCTCCAGG 
AGTAATAAGA 
ATAAAAACAA 
GATCTGTAAT 
TATCTTGGTC 
ATAGCATTAA 
CTAGCCATGC 
TCTGTTGCAA 
TCATAATCCC 
AGACTAGGTT 
GTGACACAAA 
AGAATCATGC 
CATAATTATC 
ATACTTGTAA 
CAGGTCTATT 
TTTTTTATGA 
TAGCAACGTT 
CAGCTGATGG 
TTCCTATTGA 
TGGTCTTATG 



TCTCCAACCA 
TAAAATTGGA 
TGAAACAATC 
TTGTGTTGCC 
CCATCCTAAA 
TAAAGGTACT 
GCCCTTAACT 
AGCTAAGGcA 
TGTTAAACTA 
AAAACTCCCA 
CACATAACCT 
ACAAATTTCT 
TCTATTATTT 
GTAATAATTA 
TAAAATTTGA 
GCcCGCAAAA 
TTCTTTGAAG 
CCTCCTTAAA 
TAATAAAATT 
TAACATTTAA 
ATCTAAAAGA 
CCAATAAAAA 
AATTAAATCA 
TATAGTATCA 
TGGGGTGTAT 
AATGATGACA 
TTGTGGTTCT 
TACGATTATT 
GTTAGTTATC 



ATGGCAATTA 
ACGCCCATAT 
ACAGCTAACA 
CATTTTGTTT 
TTTAATAAGG 
GGCAATAAAA 
GATGGTAACA 
TAAAAATGAA 
ATTTGTCCTT 
ATCACTGTGT 
ACAATCCCCA 
CCAACTAACA 
ACAGTTGGTC 
TTTAACGCAC 
GGTATTGATG 
GCCGGAATTA 
TTAACTCTGC 
TTTGTTCATA 
AAATTGTGAC 
TTTATTGCTA 
GATTAATATT 
AATCATTACG 
TATTTCCTGC 
TCTCAAATCC 
AAATTGTTTG 
TTTGTACAAC 
TCTTGGCCAC 
GAGTTAAGTC 
ACTGCATGGA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 

1 c ri 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCTG TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 1440 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 1560 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TTAGAAGTAA GCACTTTAAT ATCTTTACCT ACCCATGTAC CAACACCTTC TTTAGGCTTT 60 

GGATTTTCAG CATCGTTATT TGATTTATTC ACCTGTTCAG ATCTATTTAC TTTATTACCT 120 

TTATTCGGGT TCTCTAATAC ATCAAATTTT AATCTCGGTG AATAAAAAAG ATATATTAAA 180 

AATCCAACTA AAAATAATAC ACCGACGACT CTTATAATTA ACTTTTTCAT CAATCAACCA 240 

CCTAAAAAGT ATTAATACTA TTGTAAAAAA CAACACATTA ATTAGCAAAT TTTCAACACT 300 

GACATAACTG TGTCGTTTCG ATAAAACAAA ATCATCTTCA GGCATAATTT TAACGTCATA 360 

AAGATTTTCC CTACACTCTA TATCATAACC TATCTTTATG TTTTCAGGTT GAATTTCATT 420 

ATTAAGATTA AAATATGTAT AAAAAAATGG ACAGTTAAGG TATCAAATGA ATACCATCAA 480 

CTGTCCGACT ATTCTTCATC AAAAAACCTG ATAAAACAAA TTGCCTTATC AGATTAGTAT 540 

CATTTGTATA AGCATATTAA TGACCAAATG TTGCTTTAAT CAGTGATGTC GTTTCTCCAC 600 

CTGGATATAA TACATATAAT AATAAGTAAA CTGCTACACC TGTAATTGCA GTACAGAACC 660 

ATATAACTGA AGCGAATGGC CCGAATTTGC GGTGTACATT ATATTTATCT TTAAATGCAG 720 

TAATAATT-TG AACTAGGCCT AGAATACCAC CAATTGTTGC TAAATTAATG TGGAAAAATA 780 

AGAAAATCGT ATAATATTTC TTAATTGATG CTGGTCCGCC AAATGCTGTA TTACCGATAA 840 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCC3TT AAAGTX3AGAG 
AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TCAAAAGATA CATACTAATT 
TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 
AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 
TAAAATTAAA COTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 2760 
AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 
(2) INFORMATION FOR SEQ ID NO: 296: 



IS 



20 



25 



30 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUWY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
TCTGTTAAAA TGATTTTTCT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 
TTATGGTCTC GATATTGGTA GAATATCAAA TGGTTAAATX3 AGAAAAACTT GGAGGTCCTC 
ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGGAAAA GGAAAAATAA 
CGGATTTCTT GGcAGAACAG TCAGATGTTA TCGCGCGTTT TTCAGGTGGT AATAATGCAG 
GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 
TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTCATCCA G1TGCACTAT 
^ TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAAITTA CGTATATCTA 

ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 
gTGGtGACAA TAAGATTGGT ACAhCTPAPA AAGGTATCGG TCCAGCATAT GTAGACAAAG 
TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTOAAAA AGAAACATTC GAAAGATTAT 
TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGG TATGTTTAAC GAAACATCTC 
CATCATTTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 
TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GTACTTTTCG 
AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCAITC GTTACATCAA 
GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 
AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTIXSG TGATGGTCCA TTCCCTACTG 
AATTATTCGA TGAAGATGGA CATCATATTA GAGAGGTTGG TCGTGAATAC GGTACAACAA 



2520 
2580 
2640 
2700 



2817 
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ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCX3TG ATACGTACTT 
CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 
TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 
GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 
GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 
AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 
ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 
GCATTAATTC AT-TATGTCAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 
GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 
TTGTTTTTAT TCriTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 
GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 
GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 
ATTAGCTCTA ACrTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 
ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 
TGAGGTIGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 
TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 
TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 
TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 
GGTCCCCAAA CTOTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 
CGTGATTTAC CATTTGTTAA ATATTGATCX5 ATCAGTTTTG TATCATCATC TCTATGGAAC 

ACGcixACrr CTAAArrrAA tgcttcactg atgtgtttta gaattgggag attcatcatt 

GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 

tttttgatgc gtgaatcatc ttctggtaac tcaaatgatt gatagatact gagaacggta 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 
GTTTCTAAAT TTCTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 
AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 
GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 
GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 
TTGGTTTATA TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAATCTT GAAATAAACG 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



5 


TACCACCAAA TAATATATTA 


GCTGGCATTT TAATAACATT 




ATGATATCAT 


60 




CAATAAAATG TTGAAACTTC 


GTAATTTCAC CTTCATAATC 




Cj L TAATTG CG 


120 


10 


CACTCGATGC TTGCTGATCT 


AAATTTAAAA TATTCGAPAT 




TAATAAACTA 


180 


AATGTTCTAT TAAGCCATCG 


TCAC 'I'CiTlT CCTTTGGTG r 




GCGATACGTT 


240 




TCAAAGGATA GTGTTGCGCC 


AATTTTAATG TCATTAATrr* 


AC CTAAAG AC 


ACACCCGTTG 


300 


15 


CACTGATAGA TTCATAArrr 


J- i 1 UACTA AAAATTGGTA 


AGCTTTCTCA 


ACTTCTTCCC 


360 




ACCAATCATC TACATTATAT 


GTCATGAAAT CTTTCAACAA 


TAAACCATGA 


CCTGGATAAT 


420 




TCGGTGCATA ACAACTAAAT 


CCTTGGTCAT TTAACTCAGC 


TGCAAGATGC 


TTCACATCCC 


480 


20 


GATTTGTACC TGTAAATGAA 


TGTAATAATA ATATCGCATG 


TCCATTTGTG 


CCTTTTAAAT 


540 




ACGtGGACTC GGTGTTTTAA 


TTCTCATTTT TCtaTATAcC 


TCCACTAtGT 


CTAAAGakGT 


600 




TkGCTAAACG CGTTGtCGTC 


GATGATTAA 






629 


25 


(2) INFORMATION FOR SEQ ID NO; 295: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



45 



50 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


295: 






TATGAAAGTA ATGAATGGTA 


ATATTATTAA ATTTGATGGA 


AAAGTAGATA 


TTGATAATGC 


60 


AGA'^ATATC 


GGTTTTTTAA 


TTGAGCATCC 


TAAATTATAT 


GATAATAAAT 


CAGGATTGTA 


120 


TAACTTGAAA 


TTATTTGCAC 


AAGTATTAGG 


TAAGGGTTTT 


GATAAAGCAT 


ACACAGACAA 


180 


AATTATAGAT 


GCATTTGGTA 


TGAGACCTTA 


TATTAAAAAG 


AAAGTTAAGA 


AATATTCAAT 


24 0 


GGGGATGAAG 


CAAAAGTTAG 


CAATTGCAGT 


ATCTTTArTG 


AATAAACCTA 


AATTTTTAAT 


300 


CTTGGATGAG 


CCTACAAATG 


GkATGGATCC 


AGATGGCTCa 


ATTGATGTGC 


TGACTACAAT 


360 


TAAGTCTTTA 


GJcAAATGaAC 


TTGATATGAG 


AATTCtAATA 


TCAAGTCATA 


AGTTAGAAGA 


420 


TATTGAATTA 


ATTTGTGATA 


GAGCTGTATT 


TTTAAGAGAC 


GGnCATTTTG 


TTCAAGATGT 


480 


AAACATGGAG 


GAAGGTGTTG 


CATCTGACAC 


AACGATAGTT 


ACTGTTGATC 


ATAAAGACTT 


54 0 


TGATAGAACT 


GAAAAATATC 


TTGCAGAGCA 


TTTCCAATTA 


CAAAATGTCG 


ACAAAGCAGA 


SOO 
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(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 



10 




1*rTTATGnAA 


CATGCTGnGA 


TGCTACCATG 


AACCTTCTTC 


ATTTCTTTAT 


60 






Ga AATT AATC 


AAATAATAAT 


ACGGTGGATA 


CTTTCCTAAT 


TTACGATATT 


120 




l,V.^ Ill 1 V7 




GTTAAATAAT 


CATTrrriTG 


AACATCCAAT 


ATTGAATAAT 


180 


15 


VjA 1 V, 1 1 1 


ATACGTTTGA 


ATGATGACTT 


GACCTGCCTT 


TTCATGACGA 


CCAGCTCTAC 


240 




(Jl/iViv. 11^ 


CGTTAATAGT 


TGATAAGTAC 


GTTCGCTCGC 


CCGAAAATCA 


GGTAAATTTA 


300 




ACAl Ivjl Ai^ 


TGCATTCAGC 


ACACCAACTA 


AAGTAATATT 


TGGATAATCT 


AATCCTTTCG 


360 


20 


CAATCA JL \- 1 Vj 


AGTACCTAGT 


AAAATGTCAC 


CGTTACCTTT 


TTCGAATTCA 


GTCAATAACT 


420 




A 1 Iv-AlvjUjt- 


ACCTTTCTTT 


GAGGTTGTAT 


CTACATCCAT 


CCTAATTATG 


CGCGCATCTT 


480 


25 


CAAATTCTTG 


TTGCAATAGT 


TCTTCAACTT 


TCTGAGTACC 


AGTACCTACT 


TGTCGAATGT 


540 


GTTCACTCTC 


ACAATTTGGA 


CATTGATTCG 


GTGGCGTCTC 


TTGGTAACCA 


CAATAGTGAC 


600 




ATTTTAATAA 


GTCTGTCGTT 


TTATGATACG 


TTAATGAAAT 


ATCACAGTTT 


GGACATTGCG 


660 


30 


GTACATATCC 


ACAATCCCGA CATAACATAA ACGATGCATA 


ACCACGTCGA 


TTTAAAAATA 


720 




AAACAACTTG 


TTCCTGTCGA 


TCTAATCTTA 


ATTGTATGGC 


TTCACGTAAA 


TCTTTTGAAA 


780 




ACATTGACCG 


ATTACCTTCA 


CTCAATTCTT 


CACGCATGTC 


TACTATATCA 


ATTTCAGGTA 


840 




AAGCTTGTTG 


GTTCACTCTG 


TTTGGTAATG 


ATAGCAAATG 


ATAAACGCCT 


TTTTCAGCTC 


900 




GTGCATAACT 


TTCAAGACAT 


GGTGTTGCAC 


TTCCTAAAAT 


GACTGGACAG 


TGATGATATT 


960 




CAcWCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG 


ATAATCTTCT 


TGTTTATATG 


1020 


40 


TAGATTCATG 


TTCTTCATCA ATGATGATTA ACCCTAAATT 


TTTGAAAGGT 


GCGAACACAC 


1080 




TTGACCTTGC 


ACCAACACTT 


ACTCTCGCAC 


GACCATCCCT 


AArrnTTGC 


CATTCATCAT 


1140 


45 


AACGTTCCCC 


ATTAGATAAG 


CCAGAATGTA ATACAGCAAC GTCATCACCA 


AATCGACGTT 


1200 


TGAAGCGTAA 
TTCCTTGG 


AACCATTTGC 


GGTGTTAGAG 


CGATTTCAGG 


AACTAACATC 


ATCGCCTGTT 


1260 
1268 


BO 


(2) INFORMATION FOR SEQ ID NO: 294: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

55 
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10 



IS 



20 



25 



30 



35 



40 



4S 



SO 



GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCrrPTCAA AATTATAAGC 
GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 
AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 
AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTCATGTC AtTATATTTT 
CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTrTCAGCTA 
TAGATAATC 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC 
TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTCAAAT 
TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT 
GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG 
CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC 
CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT 
TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT 
TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT 
CCTC^TTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA 
GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG 
CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT 
CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA 
CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATCTCCGCC 
TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT 
ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG 
AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 
(2) INFORMATION FOR SEQ ID NO: 293: 



GCCCTAAAAT 
ATCTTGTTTT 
TTAGATAAAT 
TCTTGATTGA 
ACATTAACAT 
TGATGATAAA 
GCCAATGGAA 
GTAACAGAAT 
TTAATTGCCT 
TATGCATAAA 
TTCACTTCAT 
GAACCATTAT 
TATCACTACC 
TTACAGTTTG 
TTAAGTTTTC 



1800 
1860 
1920 
1960 
2040 
2049 



60 
120 
180 
240 
300 
360 
420., 
4B0 
540 
600 
660 
720 
780 
840 
900 
942 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
nGTnCGGnCA GATATATTGG TC3GTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 
^ GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 
TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 
TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 
ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 
GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 
GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTC TTTGGAGTAT GTAGAAGTAC 
AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 
20 TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATCAGC GACACATATA AAAGCTACCT 
ATTAGCAGTG TTOTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 
TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 
CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 
ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 
^ ACMTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 
AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 
AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 
CCCT^GAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 
AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 
AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 
GTTAAAGAAG CACCAATCCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 
''^ AAGGAAACrr CTGCTGATTT GACTAAAGAT TATGTrTTAT CAGTAGGAAA AGAGTTCCTT 

AAAAAAGCAG ACACITCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 
GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 

-^^y^r^nnr^nr^ •mr'aaanrTT ACiTGCAAGTC 1620 



35 



40 



AC3AATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 
GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



1680 



55 



iDOaO: <EP 07e65l9A2_l_> 



1117 



EP0 786 519 A2 



10 



IS 



20 
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30 



35 



40 



45 



SO 



TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA 
AGGCACTACA mTTCTCTT TAATACCAAA TGCTTCAGCG 
AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT 
CTTTTTACCA GTACGACTAA AGCCAACTGT TTTAAAGGCA 
AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC 
AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA 
TCTATCTGTA ATATTCGCTT TAAAAACAGT GTGAATATCA 
ACGTAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC 
TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT 
TTTAAAAATA AATTGTCTTG TATTAAATGA TTGTTTGCGG 
ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC 
AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA 
GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG 
TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT 
AATCACATCT GGGTTTAACT GTCCAATATA ATCAAGTTCT 
CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT 
TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA 
TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA 
AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT 
CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC 
TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA 
CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG 
TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT 
ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA 
GTAATCGATC ATT 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TCAGTACGTG 
TAGAATGGAA 
GAATCAATAA 
CCAACGGCAT 
AATGGGTGAA 
AACTTGTCGA 
TACTTTTTAT 
ATCGTTGGTC 
TTCGTCGTAT 
AAATACTTAC 
GTTACAACAA 
TAAACCGTTT 
AAATTGATTT 
AGATTTTGAT 
TGCTCATTCA 
ACATTGAAAC 
CGTGTCTCAG 
TGTCCTTCAA 
TTAACCAACG 
CAAGATGTAG 
AATTCTTGAC 
TCTACAACAT 
TTTTCCAATA 
ATGCTTCCTT 



GAACACCTGT 
TATCGGTTTC 
ATGGTCCACC 
GCCATACTTG 
AGTCATCAAC 
TGATGCCACG 
CTAAATTTTG 
TAGAGTCTGA 
TAAATATCGC 
GTAATTCTTT 
CATCAAGGAC 
TCTTCGAAAT 
TACCATTTTG 
ATGCATCATT 
CTAATAAGTA 
GACAAACGAA 
TTAAATTTCT 
CGAATAACTG 
TTAATTCATC 
ACAGGTCTGC 
CTTTTAACAA 
CTTGGAAATC 
CTAGACGTGT 
CTGGTGAAAT 



540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1 1 4 C 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1933 
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40 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGAcrrrr ccccctcata atcttcatgg tccaggcgtc cattaatgcg tcaaaggatg 
GCACATrrrA cctggaacaa atgattcata tggttcataa aaatcacgcg tcgtaatata 

4S ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 

taaagatcaa tagtcactaa ttaataaatc tgttatgaac agtatatcat taacttctct 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 
CACAAATGGA TGCATTTTAA ATAATACAAC CGCGrrATTT TTTTCGCAAT ATCTTGCTAA 
ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA 



1140 
1200 
1260 
1320 
1380 



CGCTGGGGCC CAATTTAATT TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 
TTCAGTTAGA GTTGGTAACT CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 

^ TCCAAAATCA TATTCATGTT CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 
CATTTCTTCr CTAATTTCTC TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 
ACnGCCAGGA AATTCCCACA TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 
AATTTTGTTA TCAGAAAAAA TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 
CACCCTTCAA TTTAAAATCA TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 
AAATCAATAT TAAAAATTAA TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 
ATACACATCT ATTATATAAT ACTTGTGAAA AGTATrGTCT TGGGGCTGTG riTTTTTACT 1620 
TTTGGGGCGT ATTTCTTTAT AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

20 TATTAAGTCT AACTGAGATT TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 
TAGAACGAGC TAATGATAGT TTAGTrAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 
AGCGAGAAGG TTITGAAAGT AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 

25 GTCrrGGTGA AGGTCGACTT AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 
GTTATTTAAA TGCCCAAACA AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 
TTTCcAACTG AATTCGCCC 

30 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 

{D) TOPOLOGY: linear 



1740 
1800 
1860 
1920 
1980 
1999 



60 
120 
180 
240 
300 
360 
420 



55 



1115 
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5 



CAAACTGCTC TTTTTGCAAr 


1 V- 1 ALX»CCCA 


TAAATATCGT 


TTAATATATC 


TGTTGCTAAA 


2S20 


TAAATAGAAG CAAACATGAr 


/\1 lACCTAAA 


GTTGCTGAAA 


TACCAAAGAT 


TTCTACAGTT 


2580 


TTAATCACTT GTATGTTGGC 




Uv.,AA n.\3C_AA 


CCCATGCAAT 


TAAACCTTGT 


2640 


TTACCAAAAA AGCGATACAT 


AAGTACCATA 


AGCACGAACG 


TTGCAATAAA 


CGTAACTAGT 


2700 


CCTAAAATTT CATTATACAT 


ATTAAAATGT 


CCTCCTAAAT 


TTTGATCATG 


CGGGTGTTTA 


2760 


GAAACCGCTC AATAAATAA 










2779 


(2) INFORMATION FOR SEQ ID NO: 28 9: 









(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 





(xi) SEOtJPMr'P nccno T DnpT/^xT 


SEQ ID NO: 


289: 






2S 


ACTGATGTGC GTTCATrvAAA aar^naTaTna 


TCAAATTCAT 


TTTCATCAAA 


TTGCTTAAAA 


60 


TTATCATCTC TAGATAATi^T XTnAATY^r^o^ 


GCAAATAAAT 


ATTTGGCATC 


GACATCTCTA 


120 




x^jxxxxuuxtj lUAACAATCC AAAATCACTA 




TAGGTAATAC 








TCAi-ri-i-rrA 


TTTTTTAAAT 


180 


30 


TCTTCCTTAG CTCTATTTAA AATCCCCTCA 


TTATGAACAA 


TAAATAAAAA 


TTTATTAGGG 


240 




TTTACTTCTC TPiACATCrTAA TGCACATAAA 


ATCGTTTTAC 


CTGTACCAGT 


TGCAGATATT 


300 




ATTAACGCCT TATCTTTGGC TTTATCCCTA 


ATAGCITITA 


ATGACCTTAA 


TGCTTCTGCT 


360 


35 


TGCATTAAAT TGGGTACAAT TTCCACTGAT 


TTTTTCACCT 


TATCAGCTAG 


CAGCATTTGA 


420 




GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG 


GAACGGTACT 


CAAATGATTC 


TTTATATGAA 


480 




TTAOTCCATT GCTCAGTCAG TGGGGTACTC 


TTTTGCCATA 


ACAAGTCAAA 


TTCACTi-rrr 


540 


40 


ACACTATCAA CTAAATCGCC ATTTTTCATA 


GTAGACAGTA AAACATTATG 


CTCATAATTA 


600 




ACCTTTAAGG CATTAGATGT TAAATTAGAG 


CTTCCTATTA 


CCATAGAACT 


ATAATCCTTA 


660 


45 


TGCTCAAAAA TATATCCTTT GGCATGGAAT 


CCAGCAATAT 


CAGTTAATCT 


TACCTCTACA 


720 


ri'l-i-riAATT TAAGTAATTC TCCATACATT 


TTAGGACTAT 


TAAACCCTAA 


GTAATTAGAT 


780 




GTTAATATTT TCCCTTTAAC ACCCTTATTG 


CTTAAATCTA 


ATAGTTGAGC 


CTTTAAGCTG 


840 


50 


GcTAAACCGC TTTCTGTTAT AAAAGCCACA 


GAAAAATAAA 


ACGTTTCACA 


TTTTTGAAGT 


900 




TCATCTATAA TTGTTGAAAG AACTTTTTCA 


TTTTTATTAT 


TTACTAAAAG 


CTTCGGTGTA 


960 




TAATTCCCTT TATGAGAAAT ATGTTTGTCT 


ATAAACCCTT 


TATGTAAAGA 


TTGATTGAAA 


1020 
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AAATAATGGC 
TCGTAAAGAA 
GAAAGATAAC 
TGTTGAAACA 
AAAGAAAAAT 
TAAAAAGGTA 
TTCTTTGATA 
TTCTTTTTAA 
TCTCTCGGTA 
TCAAAATAAG 
AGCTTTGAGT 
GCATAAATAC 
TATGTATAAT 
CCTTTCGTCG 
TTAGTTAAAG 
CATCACTTAA 
TACGTAtACT 
TTCTATGCTC 
AGCTTTTTTC 
GGCAATGCGA 
AATCTGCaAG 
AATCCAAGTT 
CCCTTAAAAC 
TCTTACCTTT 
AAGCTAATTT 
TTCCGATAAA 
TACCATATGg 
AAGATAAATA 
CGTGGCACAA 



GAGTCACTCG 
GATGAAGAAG 
GAACATTCAC 
ACAACTAGTA 
CAAAAAGATA 
GCAGCTAAAA 
AATAGaGGAG 
TTAATTTATA 
TCCATTTAAC 
GTTTGAAATT 
CTGTATATAA 
ATGCAGCCCA 
GCTGCTCATC 
CAGCATCAAA 
TTACTATTAC 
ATAAGTAACA 
GTaATATACT 
TACGATGTTA 
AATTCTAGTC 
GTTGGGGTGT 
TTTGGGATGG 
GGGGTGGGAC 
TTATTCTTTT 
ACGATAAAAT 
TAATACGTAC 
TGCGATAGCT 
CACGGATGAA 
CATCAATATG 
CATCAAAGAT 



ATAAAGTTAA 
ATGTGGAAGT 
CACTCTTATT 
TTGAATCTAA 
ACCAATCCAA 
AGAAGAAAAA 
CACCX3ATTGA 
CAATGCCTGT 
AAATAATAAA 
TGCGTTTTTC 
TAGTGCGTTT 
TTCTGCAGTG 
TTCTTTGATT 
ATTTATTTTC 
TGTAATACAA 
CGTGTTTAAA 
TCCTAAAAAA 
TATTGGCAGT 
AGGGGCCCCA 
GGGTCCCAAC 
GCCCCAACAA 
GACGAAATAA 
GTGTAGTAAG 
GATTTAGCAA 
GTTGTAATAA 
ACAAATAAAG 
AAACGTTTTa 
TTGACCAATA 
TGCGTGTAAT 



ACATACGCCG 
TACAAATGAA 
CGCAAAACGA 
AGATGAGGAC 
AGACAAAAAG 
GAAAGCTAAG 
CATCACATCA 
TGAGCX3TGTT 
TCAAAATCTT 
ACATAACCAG 
TGAACATTTA 
TGGTTATCCA 
ACAATGGCAC 
GCCATAATAA 
AATATGTTGG 
ATACTCGCTG 
ATCATCTTCA 
AGTTGACTGA 
ACACAGAGAA 
ACAGAAGATG 
AGAGAAATTG 
ATTTTGCGAA 
TGCGTTAATA 
TATATCCAAA 
ATATTTCAAa 
CTGTATCAAT 
TCmGaACTAA 
ATATATGCGA 
GCTTTTTGTG 



TTCTTCTTAC 
AACACAGATG 
CGCAAAGATA 
GTTCCTTTAT 
TCAGCATCAA 
AAAAATAAAA 
GTCGGTGCTC 
GATTCGCTTC 
TTTCAAATAT 
CTTCAATGCT 
ATTCACGTGC 
TTTCGCCTAA 
ATGTACTTAT 
ACCTACTTTC 
GTAATCCATT 
ATTCAAAGAT 
GGCTGGGACA 
ACGAAAATGC 
TTTCGAAAAG 
ACGAAAAGTC 
GATTCCCAAT 
AATATTATTT 
GCCTTGATCT 
TGGTACATTG 
TACAAmTGTa 
TATTGrGCTT 
ATACTTTTTT 
CGATTGAGCC 
CCATATCTTC 



CAAAACGTCG 
AAAAAGTGTT 
AAGAGGAAGA 
TATTGGCTAA 
AAAATACTTC 
AATAATTTGT 
CTTTTATTTA 
TTTGTTTTGT 
TTCTATTTGA 
ATCTGCAATT 
ATGTTCTAGT 
CTCATGTGTA 
GCCTGGATTT 
TATTCAATAC 
AAAAAACACG 
GATTTTCTAA 
TAAATCAATG 
GCTTGTAACA 
AAATTCTACA 
AGCTTACAAT 
TTCTACAGAC 
CTGTCCCACT 
AACTTATCAA 
AAAACTGTTG 
CCAGGTAAAC 
AAAAATGTAC 
AATTAGTGAA 
TAAAGCAATG 
TGGTGCAGGA 



720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B60 
1920 
1980 
.2040 
2100 
2160 
2220 
2280 
2340 
2400 



55 
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IS 



20 



25 



30 



CATTCCAATT TGCATATTTA AGTTTAATCC aGGAAGAAAA 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT 

AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCX3G 

AAGAAGTGTT ACCTGraAtT GcGGTTATCc ATcATTTaAT 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAAITCC 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT 

GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT 

ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC 

(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GATAcAAGCA 
TGAGCGTTTT 
TCATTTAAGT 
TGaAGTTGAt 
TTTCGATGTG 
TATTGGTGAT 
AGGTTTTGAA 
ATTAAAAGAT 
TGTAAAACAG 
ACCACCAACT 
AAAATCGGG'^ 
TTCCTTCACC 
CAAATCGTCC 



AATCATCAAA 
ATGAACAACC 
GCTACTGTAA 
CCAGTTTTAT 
TATCCTCAAG 
TTTCCAGTAG 
GAAGGACATA 
GCTGGATATG 
CTTGAAAAAT 
TTATTTAAJ^A 



AATTTCCAAG 
TA 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1732 



35 



40 



45 



SO 



SS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT 
AAAAATAAAT AGTATTCGAA ATAGTCAAAT TGGCACAGCT GATGAAAAAC 
GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATC 
ACAGCAAGTT GAGGCTGCAT TGAACAATCG TATTCCTCGA ATTTCAGCAG 
AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTCGAAAT GAATCTAATA 
AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC 
ACTTGATGAA GATGATGACA TTGATCCACT TCATATCCGT CACTTTAGTA 
TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG 
GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG 
TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC 



TTTCAAATGA 60 

AAGCAGCAAT 120 

CGCATACATT 180 

TACAAATTGT 240 

GCCATTTAAC 300 

ATAAAAAGAA 360 

ATAATTTCGG 420 

CTAGTTTCTG 480 

AAATAAGAGA 54 0 

CACTTTTATT 600 
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1112 



10 



IS 



20 



30 



35 



SO 



CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



EP 0 786 519 A2 

ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 480 

TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 540 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

TGTrTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATtSGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 



812 



25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 120 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 
AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 
AAATOCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 
40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA S40 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTrCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTITATTGG AGAGGGACGA 780 
ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 840 
AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 



360 
420 
480 
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(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLCXSY: linear 



10 



IS 



20 



25 



30 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG 
ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC 
AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTCAAATC ATACCTTCAT 
ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG 
GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTCTCC 
CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTCGGAACC 
CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATCTAA 
CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT 
AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT 
CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA 
CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG 
AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA 
TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA 
TCTTC 

(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



GCACGATAAA SQ 

TCATATGTTG 120 

TCAACACTAA 18O 

ATGCACCATT 24 0 

CAGCAATAAA 300 

ACCAACCTTG 360 

GGATTATCTT 420 

GCTGTTCAAA 4 80 

TTTTACAGTA 540 

TTCTCCAATA 600 • 

AGAACGGGTT 660 

CTGGAAACGA 720 

TGCTACAGCT 780 
785 



45 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
CTAACGnGAT AAGGTTGCAA nTTTATCTCA ACATCTGATG ACTGTAATTT 
AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG 
TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTCATA CTAAAATATC 
ATAATTGTTG TTAACGGTGT ACGTATCTCA TGCGAAATTG ATCTTAAAAA 
GTCAGTTGAC GTTCAGCCTG TAACATCGAT TCTCTCGTTT GTTTAAGTAA 



TGTtAATGAT 
TACAAGTAAT 
TAAATkTCCC 
ATTTGAATGT 
CGTCACATTT 



60 
120 
180 
240 
300 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA TCATATGTAT CGATTACCX3G TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCX5A CAGAGATGGC AGCTATTGCA GATACAATTA 960 

TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 1140 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 1380 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 1440 

TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 1560 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 1620 

35 CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 1680 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 1740 

TAAGTrTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

CT^TCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 

55 



25 



30 



40 



45 



50 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 



CCCAGATATT TCGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG CTTTCTTTGA 
TTTTAGTGAA GTATACXSATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTX5T 
TGGAAATCAA TCTTCATTTT ITATCTTGAG TCATGAACAA ACGAAAGGTC GTGGGCGATT 
TAATAGACAT a^GGAGr^CTT CAAAAGGGCA AGGACTTTCG ATGTCTGTCX. TGTTAAGACC 
,^ TAACGTTCCA TTCTCAATGA TATCTAAATT TAArTTAn^ ATTGCATTAG GGATAAGAGA 

TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 
TGATAATGGT AAAGTGTGTG GTTTCITAAC TGAAATGGTT GCTAATAATC ATGGTATAGA 
15 AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATCAAAG 

TATTAGACAT AGAGCAACAA GTATACAATT ACATCATAAA AATAAAITAG ATAGATATCA 
ArrrTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATO TAACGTTACC 1140 
TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTTGGAATA GAACGITGCt 1200 
ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 
TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTITA ATTAGTGCAG ATATAGATTT 
TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATrTG 
GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 
30 (2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 
35 ^C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1260 
1320 
1380 , 
1421 



40 


-(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


284: 






CCAAGTTGCC 


TAAAATGATT AAGCAAGGTT TATACCCTAT 


GCnAACGATT 


GAACAAGAAT 


60 




CTGGAGCCAT 


CCGACTGCCA ACGATTTCTA GAGTGAnGCG 


TTCATTACAA 


TGGGGTAATG 


120 


45 


ATGCTTATAC 


AATGATTTTA GATCGTATGA ATATTGAAAC 


AAATGAATAA 


TAAATGAACG 


180 




ATAAACAATG 


GTTATCTATC TGCACTAATA AGGTAGATAA 


TCATTGTTTT 


TTCACGAAAA 


240 




AATTTACAGA 


GTAAAAGAAC TTAAATTTCA TATTAAGTCT 


TTAGAACTCG 


ACACTTAAAA 


300 


SO 


ATGCTATAAT 


CATATGTATG TTAAAAAAGG AGTTTCGGAA 


AATGTATGAC 


ATTAAAAAAT 


360 




GGCGCCATAT 


TTTTAAATTA GACCCAGCTA AACATATTTC 


AGATGATGAT 


TTAGATGCGA 


420 


£S 


TTTGTATGTC 


TCAAACAGAT GCAATTATGA TTGGTGGAaC 


TGATGACGTT 


ACTGAAGATA 


480 
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GCCGGATTCT 


CCTGGTGTGC 


CTGAAGGTAT 


TGCTAAAAAT 


ACAATTACAG 


TTCCATACAA 


2100 




TGATTTAGAT 


GCACTTAAAA 


TCGCTTTCGA 


AAAATTTGGa 


AACGATATTG 


CTGGTGTAAT 


2I€0 


5 


CGTAGAACCT 


GTTGCTGGTA 


ATATGGGTGT 


CGTACCGCCG 


ATTGAAGGTT 


TTTTACAGGG 


2220 




ATTAAGAGAT 


ATTACGACTG 


AATACGGCX;C 


ATTGCTAATT 


TTCGATGAAG 


TAATGACTGG 


2280 


10 


TTTCAGAGTC 


GGTTATCATT 


GTGCACAAGG 


TTACTTTGGT 


GTGACACCAG 


ATTTAACTTG 


2340 


CTTAGGAAAA 


GTTATCGGTG 


GAGGACTACC 


TGTAGGTGCA 


TTTGGTGGTA 


AAAAAGAAAT 


2400 




CATGGATCAT 


ATAGCACCAT 


TAGGAAATAT 


TTATCAAGCG 


GGTACGTTAT 


CAGGAAATCC 


2460 


15 


TCTTGCAATG 


ACAAGTGGTT 


ATGAAACGTT 


AAGCCAATTA 


ACGCCAGAGA 


CATATGAGTA 


2520 




TTTTAATATG 


TTAGGCGATA 


TACTTGAAGA 


CGGTTTAAAA 


CGTGTATTTG 


CTAAACACAA 


2580 




TGTACCAATA 


ACTGTAAATA 


GAGCAGGTTC 


AATGATTGGT 


TATTTCTTAA 


ATGAAGGACC 


2640 


20 


TGTAACTAAT 


TTTGAACAAG 


CGAATAAAAG 


TGATTTGAAA 


TTATTTGCAG 


AAATGTATCG 


2700 




AGAAATGGCA 


AAAGAAGGTG 


TGTTTTTACC 


ACCATCTCAA 


TTTGAAGGTA 


CATTCTTATC 


2760 




TACGGCACAC 


ACGAAAGAAG 


ATATTGAAAA 


AACGATTCAA 


GCATTTGATA 


CGGCTTTAAG 


2820 


25 


TCGTATTGTA 


AAATAAATAT 


ACGGACAAAT 


TGAGAGCCTG 


AACTTTGTTC 


AGGCTCnTTT 


2880 




TAAATGTATA 


TAAGGCATGG 


GCGGCGACTT 


GATAGTGAAA 


GTCCACTACT 


A 


2931 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 



40 


AATTATGAAT 


GCATTACCAG 


TATTATTACA 


AAAGAACAAT TAAAAATGTT 


TGTTTATGAT 


60 




TATGATACGC 


ATCTCATTAA AAATGTAaTG 


GTTGCAGCAG ACGTGTTAAA 


GGCAAATGAT 


120 


45 


ATTCAAGGAC 


ATGAACCATT 


AATCGTTAAC 


CTTCAAACGA TTGATGAAAC 


ATTACATCGT 


180 


TTACCTATGC 




AGACATGATG 


GTTAATGGCG GTGTACTTAT 


GGCACATTTA 


240 




AATGCCAAAA 


GTGGTCCGTG 


GTTAAAAGAT 


GTGCTAAGAC AAATTGAGAT 


AGCGATTGTA 


300 


SO 


ACAGGTA7AG 


TAAGCAACGA 


AGAAACTGAA 


ATTTTGAAAT GGGTGGATAA 


TCATGTCAAA 


360 




ATATAGTCAA 


GATGTACTTC 


AATTACTCTA 


TAAAAATAAA CCGAATTATA 


TATCTGGACA 


420 




AAGCATTGCG 


GAATCACTTA 


ATATTTCACG 


CACTGCAGTA AAAAAAGTGA 


TTGACCAATT 


480 



55 



NSDOaO: <EP 0786519A2_I_> 



1107 



EP 0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



AATGPiAGTTG TTAAAATAGA TTTATATACT 
GTTAAAGAAA TGATAGAACA TCAACAAATC 
GTACGTTATT ATTTTAATGA AGGATTTGTA 
GAACAAACAG CACGGACCAT TAAATCATAT 
ACACTCQAAT CACTAATTGA AAAGATTTTA 
GACATAGAAG ATTGAGATCA TCAGCGACAA 
GAAAAGAAGA TTTAATATAT CCAATTTTTG 
TTAAGTCATT GCCAGGTGTA TACCAAATCA 
AAGCTTATGA CTTAGGCATA CGTGCCATTA 
ATATAGGTAC TGGTGCATAC ATTCACXIATG 
AAAAAATGTA TGATGACTTA TTAATTGTTG 
ATGGTCATTG TGGCGTGATT GATGACCATA 
CACTACTTGT TAAAACAGCA ATTTCTCAAG 
GTAATATGAT GGATGGTTTT GTTGCTGAAA 
ACAATATTCC TATAATGAGT TATGGTGTCA 
GAGATGCAGC AGATTCAGCG CCATCATTTG 
CTAACCGTTT GGAAGCACTT CGTGAATTAG 
TGATTGTTAA ACCTGCTCTA AGTTATTTAG 
ATGTTCCAGT TGTTGCATAT AATGTGAGTG 
AAAATGGTTG GATAGATGAA GAACGTGTCG 
CAGOTGCTGA TATGATTATT ACGTATTTTG 
AAGGTTTTAT ATTTATGATT TTCCATAAAC 
GAAATCAGAA GAAGCAATGa AGGTTGCTGA 
AGTACGCGCA TTTAAATCAG TAGATACACC 
AAAAATTTAT GATATCGATG GTAACGAGTA 
TATTTTAGGA CATAGAGACC CTCAAGTTAT 
TACAAGTTTT GGTGCATCAA CATTACTTGA 
AGTACCTTCA ATAGAAAAAG TGCGTATGGT 
TTTAAGATTA GCACGTGGTT ATACTGGCAG 



TCAGTGCCTA ACAAACAAAA TATACAAGAT 
GATGCATTAA CATTTTCAAG TTCGTCGGCA 
CCAAAATTCA AGTCGTATTT TGCTATTGGA 
CAACAACCAG TAACAATTGC AGAAATTCAA 
GAAAGTAGGG GCTAAAAATG AAATTTGATA 
TGAGAGATAT GGTTAGAGAG AATCATGTAA 
TAGTTGAAAA AGACGATGTG AAAAAAGAAA 
GTTTGAATTT ACTTGAAAGT GAATTAAAAG 
TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 
GTGTTATTCA ACAGGCAACA CGTATTGCTA 
CAGACACTTG TTTATGTGAA TATACTGATC 
CACATGACGT TGACAATGAT AAATCATTGC 
TGGAAGCTGG TGCTGATATT ATTGCGCCAA 
TTCGTCGTGG ATTAGATGAA GCCGGCTATT 
AGTATGCATC AAGTTTCTTT GGACCTTTTA 
GGGATAGAAA AACGTATCAG ATGGACCCTG 
AAAGTGATCT TAAAGAAGGG TGCGACATGA 
ATATAGTTCG AGATGTTAAA AATCATACGA 
GAGAATATAG TATGACTAAA GCAGCGGCAC 
TTATGGAACA AATGGTTTCA ATGAAACGTG 
CAAAGGACAT TTGTCGCTAT TTAGATAAAT 
TGTAGGAGGA ATTTACTTTA TGAGATATAC 
AACTTTAATG CCTGGTGGTG TAAATAGTCC 
AGCAATTTTT ATGGATCACG GTAAAGGTTC 
TATCGACTAT GTACTAAGTT GGGGACCACT 
TAGTCATTTA CATGAAGCAA TTGATAAAGG 
AAATAAATTG GCGCAgcTCG TTATTGACCG 
GTCATCTGGT ACAGAAGCTA CATTGGATAC 
AAATAAAATT GTGAAATTTG AAGGTTGCTA 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 





AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA 


TATCAGTGGT 


GTTAAAGAAC 


€0 


5 


GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA 


TTAAATGCaG 


AATAGAGAGG 


120 




AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT 


AGATATGGAT 


GACACATTGC 


180 




TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA 


TTTAACAGCA 


ATTCAAGATG 


240 


10 


AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA 


AGGTATGATT 


CCAACTGCTA 


300 




GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG 


TTATAACGGT 


AGTAAAACGA 


360 


IS 


TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT 


TGGTAAGCAA 


GATTTCGATG 


420 


AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT 


TACATATCAT 


GATGGTCAAA 


480 




TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC 


AGAATTAACA 


GGTTTACCGA 


540 


20 


TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCGA 


TGTACCCAAG 


GTCATGGGTG 


600 




TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT 


GAATGGTGTG 


TTCAATGATA 


660 




ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT 


TATGGCCAAA 


GACGTTTCAA 


720 


25 


AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA 


TTCGGTGGAT 


CAAGTCATTG 


780 




CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT 


CGCAGGTCTA 


GCTATTGCTA 


840 




TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA 


AGTTACGTTG 


GATCATAATG 


900 


30 


AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT 


TTTAAAATAA 


GCCTTAACAC 


960 




ATGATATTTG AATAAGATAT CTTGTGGTTA AGG^-iTTTTA 


TTTTTGTGAA 


AATGACTTCA 


1020 


35 


GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG 


TAATGATATC 


AAACGAATAG 


1080 


AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG 
(2) INFORMATION FOR SEQ ID NO: 282: 


TCACGGAAGG 


nAGATAAAT 


1139 



* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
TCTAAAAATG CTCTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 
GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 
AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 



NSOCXilD: <EP 0786519A2_I_> 



1105 



2S 



30 



EP 0 786 519 A2 

GTTCAATACA GAAAAAATAA ATTTAGATCT TGAAGCATCC TACAATTAAT ACAGATCCAT 
TTCAATATAT TTAAACTAAA ATCTCC3GGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 
ATnTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 
CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 
TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 



€0 
120 
180 
240 



io«««iivj\vj «>*h_(jAA1-1A IICCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 36 o 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 420 

,5 TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 480 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 540 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATCACTTAGC 600 
20 TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTfi&Tr, aanrrinivTr-r 

•-- - — w« w J. O O U 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 84 0 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCITAAC 1020 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATTrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 1260 

40 AGAAGATCGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 1320 

TAAAGCX3CAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT GACCATTTAA CTAATCAAGT 1380 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 144 0 

TAAACAAGAA ^^^^ 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 113 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACAATTTTTT 


CTTCATTATT 


AATAGATTCT 


ATAGCTTCTA 


CTACAAGTTG 


ACGCCAGTCA 


3000 




TCTTTATAAA 


TATCTTCATT 


TCTAGTAATT 


TCCCCAATTT 


GCTCGTCCAC 


ATCTATGTCC 


3060 


5 


GATATATTGT 


TGAACAAATC 


CATTAAATCX3 


TTCAATGCCT 


CAACAGTAAA 


ACTTTCCCTT 


3120 




TTAACTGTAT 


AAGTTAAAAA 


TGTCCCATTA 


TTATCAGTTG 


AAATTAAAAC 


TTCAGGTAAT 


3180 




ACAAAATGAT 


TTAGTCCAAA 


CTCTCGCCAT 


TCATCATCTG 


ATTTATGACT 


TGAAAATTGG 


3240 


10 


AACCCTCCAA 


CAACTCGAAG 


ATGATGTTTC 


TCAGATTGCG 


GATGTATAAA 


TGTGATGTTA 


3300 




TGTTTTAATT 


TTTCCCAGTC 


TTTAAAAATA 


GATTGTTTAT 


TTTTAGAATT 


ATTTTTGAAT 


3360 


IS 


AATTGAATTG 


CTTTGTAGCC 


AAAATATGAC 


GTTCGATTAT 


CATTCAAACG 


CATATAAAAG 


3420 


CGATCTCCTG 


CCTCATTGTC AGTGAGATGA AATAATGTGC 


TCGGGTCTAG 


TGACTGTGAT 


3480 




AATTTCACTT 


CAACTGAAAC 


CCATTCCTTT 


GAGCTGCCAT 


ATATCTCTTT 


GACAATATCG 


3540 


20 


TCCTCTAATA 


CGCCCGTAGC 


CATCCATTTC 


ACTTCTTTCT 


TCGTCrriTT 


TCACTCATTA 


3600 




TTATATTGTA 


TCA'rrrri'GG 


ATAATTGTGT 


TACAAGAATT 


GCTTAAACTT 


ATCTTGCAAT 


3660 




TTTTCACGTC 


AATTGACCTT 


TATGCTACTT 


TCTATTAAAA 


TATCTTTGTT 


ATAAAAAATA 


3720 


25 


TGATTTAAAG 


AGGTTTTGTA 


TTCAATGAGT 


AATCAATATC 


AGCAATATTC 


TACAGTTAAG 


3780 




AAATATTGGC 


ATTTAATGCG 


TCCTCATACA 


TTAACTGCTT 


CCGTAGTACC 


CGTTTTAGTT 


3840 




GGTACAGCAG 


CATCTAAAAT 


ATATTTTCTT 


GGTAGCGAAG 


ATCATATTAA 


AATCAGCCTA 


3900 


30 


TTCATTGCCA 


TGTTACTAGC 


ATGCTTACTT 


ATTCAAGCAG 


CAACTAATAT 


GTTTAATGAA 


3960 




TACTATGATT 


ATAAAAAAGG 


CCTCGATGAT 


CATGAATCTG 


TAGGCATTGG 


TGGTGCCATT 


4020 


35 


GTTCGCAACG 


GTATGAGCCC 


AGAGCTTGTG 


CTACGATTAG 


CCATTGCATT 


TTACATCTTA 


4080 


GCAGCAATAT 


TAGGTTTGTT 


TTTAGCTGCT 


AACTCTTCAT 


TTTGGTTATT 


ACCAGTTGGA 


4140 




TTAGf ATGTA TGGCTGTTGG 


TTACCTATAT 


ACAGGTGGCC 


CTTTCCCTAT 


TTCATGGACG 


4200 


40 


CCTTTCGGTG 


AATTATTCTC 


AGGCX3TATTT 


ATGGGTATGT 


TTATTATCGT 


TATTGCATTC 


4260 




TTTATTCAAA 


CTGGCAATAT 


TCAAAGTTAT GTAATTTGGT 


TAAGTGTACC 


TATAGTAATC 


4320 




ACTATCGG 












4328 


45 


(2) INFORMATION FOR SEQ ID NO: 280: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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20 



25 



30 



35 



40 



45 



SO 



S5 



TAATTTCTTT 
TAACTGTCGT 
TATCTGGCGC 
CATCAGTTTT 
TTACGAAATC 
CATTCGGATG 
ATATCGTTGA 
TTAAACCTTT 
AGTGCGGTAA 
ACGGATCTCT 
GACTAGCAAT 
TAGGCATATC 
GTGGTGCGCC 
TTCTACTAAT 
TACATAATAT 
CACTTCGCTC 
CAAGTGGCGT 
ACTCAGATGC 
TCGACTCCTA 
TCTGGATCAG 
ATAACCATCG 
ACCGGTGCAC 
AATTCTTTTG 
TACGAATCAT 
TTTAGAATTT 
AAAATACTGT 
TCAACATTTT 
TTAGTCGATA 
ATAGAATCTT 



ACGCCCTTTT 
GTCTTCCATT 
TATCGGAAAG 
CTTTAACCAT 
CACATTTAAG 
ATCAAATTTT 
ATACGTTAGT 
TTTCTTATTT 
AATCTTCATC 
AAATGGCAAG 
TTGCATTTGA 
GAACTCATAA 
TACACTTCTT 
TTGGCTTTCA 
AGCGACAGGT 
ATCGGGGTGT 
TGAGCGTGAT 
AAATGTAAAA 
ATGCATTCAT 
AATCTTTAAC 
AACGAATTGC 
CATATAATCC 
GATAGCCACC 
CCTTCAGTX3G 
TAGGCGTCTT 
CAACAACAAA 
TTGTATCTTC 
GTATTTTATT 
GTGATTCTAA 



TTCTCTAAGC 
AATGACCTAA 
ACATCAATCT 
TGATTTAACT 
TCTAAGCCGC 
CTTAAATGAC 
ATTTGaTCAA 
AAAATGTGTC 
TCAGAAGTTA 
TTAAAATGAA 
TAGTAAATTG 
CTTACATAAT 
AATTCATGCG 
GCAATTGCAG 
CTTTCACTGC 
ATCCATGTTT 
CCCGGACTGA 
ACTTGCTTCG 
CATAGGTGTG 
AATGCCACAC 
AACAATAAAT 
TCGCGTACCA 
TAAAGCAGGT 
CGCCTTTATT 
ATCATAATGT 
TCGATGTTCG 
GTCCTCATCT 
ATTGACCTTT 
CAAGAATATA 



ATTGCCATTT 
AGAAATCATT 
TATCATTGTT 
TTTTAGAAAT 
TTCTAAACAG 
TTAAAGGATC 
CTTCTTGGTG 
TTAATGCCGA 
ACAATTCTGT 
TTGGCCCTTT 
CATCAATGGT 
TATTAAACAT 
GACGGTCACT 
GCGTATAATT 
CTTTAATTAA 
TAATATTTGG 
TAACTACTTC 
TTAAAGCTGC 
AACTTAAGGT 
CCAGCAAATA 
TCACAATCAT 
AATTCTTTCT 
GTTGGATGTA 
TCAGTGTACA 
AATTCAGTGA 
ATTAAGTTTT 
TGTGAACGTT 
ATTAATTGTT 
TAACTGTTTT 



TTCTAACCAA 
CGCAGAAATC 
TTGCACTAAT 
CACTGGTTTC 
CAAATCATAT 
AGCTAAAATA 
CTGCATATCC 
TGCATCTATA 
TGCATTCAAA 
ATGTGGTCCA 
CTCTTTACTA 
ATTTACTTGA 
TGTTAAAACG 
CGCTGCTGCT 
CCCAACTGCA 
ATGTGCTTCA 
CCTTACGCCG 
TTTATGATTT 
TCGTTTCTGC 
AAGTTGCTTG 
CGTATATATC 
GCTCAATAAA 
AATTATCAAT 
AGTGATATAA 
TATAAGGTTT 
TATTATCTTT 
TAATTGTACC 
CAGGTGTTTG 
TTTCTTTAGA 



CTTACGCGAT 

TCATATGAAA 

ATTTGAAATG 

CCAACACGAA 

GTACAGATAA 

GGCAAATCAT 

CCTACAATAA 

CTTTTTTGAT 

TCAGGTGTTA 

TATAAATATT 



TTAATCGCTT 
ATTAAAGGAA 
GTACCTGACG 
AAAAACGCTG 
AATGCAAGTG 
TACGCATATA 
CCCATTCATA 
CAATTCACTA 
TGCTTTCTTA 
TATATAGCCA 
ATCCATTGCA 
TAAACTAATA 
ATGATCATTT 
AATATCATGT 
TAAAAATGCT 
TGCTACAGCT 
TGAAAAGAAT 
ATATGCTTGC 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1 3 GO 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2860 
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10 



15 



30 



35 



50 



55 



TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 8340 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 8460 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 
TCTTTTCATC ATTCAC 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 360 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 480 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 540 

TTTOTAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

40 AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 1080 
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ATAATCAATT 
CGGATTATGT 
ACGTGCAGCT 
ACCAAGCGCT 
ACATTTCAAA 
CGCTTCAACA 
ATATTTCCCX3 
AATTTTAACT 
ATCATAAGTA 
CTTCATGTCT 
TTTTGCAATT 
AAAAAAACAC 
TAATTGCACG 
CCAACCACTG 
TAATAATTGT 
AACGATGAAA 
ACCAAAAGAT 
GACTTTTTTA 
AGAAATAGAT 
TGTGCCTAGA 

ttctSgtggt 
agatacaaag 

AGAGGCATAA 
TTGACCTAGC 
AAATTTCACA 
AAAGACAAAC 
TAACATGACA 
AGGATTTGAT 
AACTAATTGC 



TGCTGTAATG 
TCGGATTCAA 
AATTGTGCCC 
ATCATAATAT 
TGATGTCTAT 
ATAGTTTGAC 
AGTCTGTCTT 
TTATCATCAC 
ATCATAGATG 
CAATTTTAAT 
ATCTAAGTTT 
CTGAGCAACA 
ATAAAGACTA 
AATAATCTAA 
CCTACGACAA 
TCCATATTAT 
AAGGTTGCTG 
CGTTTTGTAT 
GACGTTAAAG 
TGCATTTGAC 
TCATAGCCAA 
ATATTCATAA 
CTCX5CATAAG 
GCAAACAAGA 
CCTTCTAAGA 
AGCAATGGCA 
ATAATCATCG 
ATTACCGCTT 
CATAATACTT 



CTGAAGTCAA 
ATTGATTCAT 
CATGCATACC 
TAATCTTTTC 
TCGCTGCAAT 
CTGCACGCAC 
TAACACTGCC 
TATAATGTTC 
CACCCTCATC 
TCTTACTTTT 
CGATTAATTC 
CAAATACTTG 
AAATAATAAC 
ATCGATCTTT 
GCGTAGTACC 
CAAAAATCGT 
GAATACTAAT 
TGTCATTCTT 
CCGCAAATAA 
TAAAGACCAT 
AACTATGTAA 
CAACGATAGA 
TAATCATACC 
TGCCATCAGC 
CGCCATCTAA 
TCATAACTTT 
TAGCGAATAT 
CAAAATTCAT 
GTCCGATGTA 



TTCGGGTCCC 
ATAAACGGCA 
TTCAGACTGA 
TTCTGAAAAA 
AGCTAACCCT 
ACGCCCTTCT 
TCCAGGATTC 
TAACAGTACT 
TGACATGCCG 
CAGATAAGAA 
AGAACCAGTA 
TGTGTCAGAT 
GACAGGCATC 
ACCAAAATAT 
TAATGCGCCT 
TCCCGCACCG 
GATAAATACT 
CGTGAAGTTA 
GAACAGCACT 
TGGCAGTACT 
AGCCGGAAAT 
AATAGCTGAT 
TGTAGTTCCT 
AGTAATCTCT 
TGTTAAAGAC 
CGATGCCTTT * 
GAATATACCT 
TTCTTGGAGA 
AATGACAATC 



AATGTATGAA 
CCATATTTTT 
CTCGTCCTTG 
CCATACGGCG 
ATGCCTGTAT 
TGAATTGCCT 
CATTGTTCAA 
AATGGTGTAT 
ATCAAATGAA 
TTATAAACGA 
CTAAATTTTC 
GCTTCTATAT 
GCATACTTAA 
TGTTGTAATA 
AATGGCATCA 
AATCTTACAT 
AAAATACTAC 
GAAACATTTA 
AAGAATCCCA 
TTAAATAATA 
ATAGCTAGAC 
GACTTAATCG 
AATGATAACG 
GATACTCTTG 
TTAATCACAA 
TCTAATCCTT 
TGCCCTAGAA 
TGATTGATGC 
CA^CCACCGA 



AATATGTATC 
CAGCATAGGA 
AAACTTCTGC 
CAAAGATCTT 
TACCAGCAGT 
TCTCTACTAA 
GCTTGGCATA 
TGCCaATTAA 
TGAAACCTTT 
CATTTTGTTA 
AATTCCAAAC 
ATTAACTAAA 
TTAAGTAATA 
ATTTTTTATC 
ATACATTCGA 
CTTTTAAGAT 
CGATCACTGC 
ATTCTAATAA 
AATAGAATAA 
ATCCAGGCCC 
CTGCCAATAC 
TCATGTCTTT 
TAAAGAATGA 
GTTGTAAAAT 
TGACGATTAA 
TTTCAACACC 
CGGTTAACCA 
GTTGAAATAT 
TAACACTATA 



6540 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 
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TAAGAGCATT 
AATTAAAAAA 
CAAATTAAAT 
TCGCACAAAA 
AGTGCTGCTA 
GTATCTTCAA 
ATATCTGCTG 
GCACCTAAAC 
TCTGTATTTT 
ACATCATGAT 
GCTTGTAACA 
AATGTTTTAA 
GAATTTGAAA 
GCAACGACAT 
CCTAAATCTA 
AAACCX3TGTT 
TTAGAAGGTG 
TCTGTATGCG 
AATGCGCGAT 
AACAGCATAA 
CCATTTTCTA 
GAATATTCAT 
TAAATTGGTG 
TTAGTTTTCT 
ATCACTACCA 
CAATGCACCT 
CAAACTTTTG 
ACGTCTCTCT 
AGCTGGCCCT 



ATATGTAAAA TTGCATATAT 
TAAGTAATTC ATGTGACAAT 
CCATAAAAAT GTCCACCAAT 
AAGCACTAGC ATATTCAAGA 
TTATTTATAA AGTATCTAGT 
TACCTACAGA AATTCTTACA 
GAATGGATGC ATGTGTCATC 
TTTCAGCTAA TGTGTAATAC 
tCACTTCAAA TGCAATCACA 
TTAAATGACT TTCAATACTT 
TTTTAATAAT TTCAATAACG 
TACCCCTCAC AAGTAAATAG 
TAAATGCTAA ACGTTCTGCA 
CACTATGTCC ACCTAAATAT 
ATGGATTCTG ATAATAAGGT 
CTTTCGCAAT TTCAGCAGAC 
TTTCAATAAA CAACATCTTT 
TTGTATCTAC AAAATCCACT 
AAGTACCGCC GTATACATCT 
CAACTGCACT GATTGCTGCA 
ATGTCGCAAT AACXJCTTTCT 
ATCCTTGACG TAAATCACCA 
TTGTAACGGC ACCTGTATAA 
TGTTCATTAT TATTCTCCTC 
TCTGGAAATA CGACAACAAT 
TGTAATGCTG CACCTGaAGA 
ACATTTCGAA AGGCATCTTG 
AAAAATATCG GCCATTTCTC 
CCATTTAACA CGGACCCTTC 



CXSTcAATACA 
GACGAATTGT 
CCTCCACAAC 
ACAACAAACG 
GCTTGTTTTA 
AGTCCGTCTG 
7ATGCAGGTA 
GATGTTGCTT 
CCTGTATGGC 
GGATGGAACA 
CTGCGATTAA 
CTATCTTGAG 
AGCTTGTCAT 
TTCGTTGCAG 
GTCATAAATG 
TTTTTAATGT 
GTTGTTGGGC 
TCAATGCCAA 
GAATTTAAAA 
ACACCTGAAC 
ACAGAACTTC 
ATATCATCTT 
TCGTCTGTTG 
ATAATTAAAT 
CGTACCTTCA 
ACTGCCTACT 
ATCTTTAATC 
AGAACCGATA 
TGGCTCAACG 



ATTTGCCGAA TTTTCTAAAA 474 0 

GAGACTACTA TGACATTTAT 4 800 

GCAATTACTA AATATTAACA 4 860 

TTGAACTCAA AATATATGCC 4 920 

AATCATCGAC TAAATCTTCA 4980 

TAATACCTTC TTTAGCTCGA 5040 

CTGAAATTAA ACTTTCCACT 5100 

TAATCAATTG TTTGGCACTT 5160 

CATCCGCTTG AGCCATATGG 5220 

CTTGTTGCAC AGCTGGATGT 5280 

TTTGTTCCAT ACGTAAACCT 534 0 

GTCCTAAAAT GCCACCTGTT 5400 

CCGATGTTGC AACTAAACCA 54 60 

AATGTAAGAC AATATCGATA 5520 

TGTTATCAAC AACTGAAATC 5580 

CAGTAACACG TAATAATGGA 564 0 

GTATCGCTTG TACAATTGAA 5700 

ATCGTGTAAA TACTTTTGTC 5760 

TAATATGATC TCCTTTGTCC 5820 

TAAATGCAAA GCCATGTTTG 5880 

TTGTTGGATT CX3CAGTACGA 594 0 

GTAAATATGT ACTTGTTTGA 6000 

TGTGCCCACC ATGAATTAAT 6060 

ATTTGCTTAG ACATATATCG 6120 

GATAATTGCG CTTTTAAATT 6180 

AACAACCCTT CATTTATAGC 6240 

GTAAATATCC CATCTACAAG 6300 

CCTTCAGTGT CATGTGCATG 6360 

GCATAACATT GCACGTGATG 6420 
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CTCTTTAACA GTTCCTGTTT CAACCACTTT CCCCTTTTCC ATTACAGCAA CACGATTACA 4 020 



AATGTCTTTA ATAACGCGCA TTTCATGTCT AATCATCATA ATTGTAATGC CAAAGGTTTG 
ATTGSCATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 
40 TGCTTCATCG CATAGGAGTA TTTTCGGATT AGTAACAAGC GCTCTTGCAA TAGCCACCCT 
TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT •rrTTTATCAC TCAATCCTAC 
AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT T-PGCTTTTCT TACTTAAAAT 
GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATCCTGAAA 
TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 
ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AGATGATTCA CGAGTCTTAC 
CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCQATTGAC 
CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 



2940 
3000 
3060 
3120 



3240 
3300 
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AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAA^TCTT 
CACTATATAT CATTAATGTC TACGTATCAA TCTAGCTAGA ACATTCCCTA GCXSTTTGAAT 
S CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 

TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 
CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 318O 
AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTCGT GAAGCGCCCA TCGnTTTCGC 
CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 
AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTACCAATTG AAGTACCTAC 
TACTAATTTT GTGAATGGCA CAATCGCAAT TAACAAAATA ATAAATCGTA GTGACCITAA 342c 
AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTGTrr 348C 
TCGAGTAATT ACCAATAAGA CACCTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 3540 
AATAGATACC ATATATAGCG TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3S00 
AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTC 3660 
ATITATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 3 720 

GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 
TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 3840 
ATCTTTAAAA TCGCCAACTT GTTCGTCATT CAATCGACGA ATCAATCATG TACrrTCGCTC 3900 
AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTrTTAGGAT GACTAAACAC 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 114 0 

ACTTTCAATT TTTGACCCXK; ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 1200 

TTTTCATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCITGTACr 1260 

GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 1320 

CTTCAGCTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 1380 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 144 0 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 1500 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 1560 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 1620 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 1680 

GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 1800 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 1860 

25 CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 1920 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 1980 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 2040 

AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 2100 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 2160 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 2220 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 2280 
TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 2340 
ATTAXTAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 2400 
AGTAATTITT AAATGTTTTG GATTTGACXJT TATATCTTTC ACCGTGCCTG CTAATCCX3AA 2460 
ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 2520 
45 TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 2580 
TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 264 0 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 2700 
^° AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 2760 
CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 2820 
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CCJ^ACPXrtAK AAAATCCATT AACTGGAGPA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 
ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 
CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 
CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 
AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 
AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 
(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 
TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 
CTTTTCGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 
GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 
TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 
CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 
ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 
CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 
AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 
GGC^TTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 
40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 
AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 720 
TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 780 
GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 
GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 
TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 1020 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 120 

AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 240 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 300 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 360 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 420 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 480 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 540 

ATTOATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 600 

GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 660 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 720 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 840 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 900 

CTT;AtCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTrr 960 

ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 1020 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 
AGAGAAGGAC AAAAAGGTCA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 
GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATFAATGAA 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 1260 
TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 
GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 
TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTOATTTA 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTG 


^ J.O 0 




AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTTTA TCTGAATTQG 


<5^4 0 


5 


CACCATTTCT ATATAAGACX3 GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 


"5 rs rt 




ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AAITTCGTCA 


J o o U 


10 


ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 


1 ^ o r* 




CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 


1 ^ o ri 




AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 


c >i n 


IS 


CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 


n n 
o o u U 




TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 


ooou 




TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 


O rt 

J 720 


20 


GTAAAGCCAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 


J /o 0 




CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 


^ o ^ n 
J o4 0 




CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 


3 90 0 


25 


GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTC 


1 o £ n 
J So 0 




ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 


>i r» n 


30 


ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 


yi rt Q /\ 
4 OoO 


TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 


** i*t\j 




ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 




35 


TAAGTCAAAA ATCAACATCC TAAAATCTAC AATGTTATAT TAACAATAGT TAACCAAAAG 


*x ^ D \J 


AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 






AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 




An 


TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAATACAC 


444 0 




ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACXSAAg 


4 500 




TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 


4560 


4S 


GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 


*t o ^ u 




ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATTSpa Bf"rT«rsrY->Kr' 


4680 




GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 


4740 


SO 


AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 


4800 




TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 
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TTACCGCAAG 
TTCATAAAAT 
TGAATTGGAT 
TTTGATTTCG 
CACCTGCAAC 
GCGCTGGCAT 
TTGACCAAGC 
TTGCATAAAA 
TCGTTCCAAG 
TCGCTCTAAA 
ACTTACCTTT 
ATATAGTTGA 
ACTGAATATT 
CTTAACACCT 
ACACTTGAAA 
TTAATATTTG 
ATAGGCTCGG 
AATGGCTCTA 
CGTTTTGTCA 
ACTTTACCAC 
TGCGTAATTA 

atttCatctg 

GGATCATTaG 
ATGGATAAGC 
GCTTAGCTCT 
CAGTCCTTGA 
CTCTTAATCC 
ATGTAGGCGC 
AACCAATGAC 



CTGCTAATAC 
GAAACCCCCA 
TATAAATACA 
AGTGAAACCT 
TGCTGTTGAA 
AGCTTCAGGT 
CGCTTCGATG 
CGGTGctGCG 
TATCAAACTT 
TAAGTTAACA 
AGCAGACAAG 
GACGACTGTC 
AGGCATTGTA 
CCATTTTAAC 
TATATGGAAT 
CTTCTAAAAT 
TTGTTGTTGA 
ATTCTGTTAA 
CAGTGTGTTG 
TTTCCATAAC 
GTACAATTGT 
TTGTTTGCGG 
TAACGCTCGT 
CTTTTCTCTA 
CCTTCTAGGG 
CCATAACAAA 
ATTTTTGGAC 
TTCTAAATGA 
GCCATAAATC 



AACTGCAAAT 
ATTTATCGTT 
ATAATTAAAA 
GTTAAGTATG 
CCAACTAAAG 
AAAAGGACTT 
ACACCTTTAT 
CCAATGATCA 
GTAAATGGAA 
ATGAAAGAAA 
AATAATAACA 
ATGTATAGTG 
ATCATTTCAT 
TTGTCGCTCA 
ATGCAGAACT 
ATTAATTTTA 
ACCAGCGAAA 
AGATGTTTCG 
CGGATTTTCA 
TGCAACTTCA 
TAAATTTTGT 
ATCAAGTGCA 
GCAATCCCAA 
CCTTTTAAAC 
ACACCTGCAA 
TTAAAATGTT 
AATTGACCTA 
TTAAACATTC 
GATCCTGCTC 



GTTAATACTA AAATAAGACC AAATAATTTT 
TATCAAGTTT ATTTGTAAGC CAATCCCCAA 
TAAAAACTGT TGATACTAAA ATGACATCAT 
CTAAATTTCC TAAACCACCG GCACCAATTA 
CGATTGCTGT AACTGTAATG CCAGACACTA 
TACGAATTAC TGTCCAAGTA TTAGCGCCCA 
CAATTTCTTT AAAAGCAATT TCTACGAGCC 
AGGCTGGTAA CGCACCTGTC GGACCACTTA 
TTAATAATAA AATTAAAATA ATAAATGGTA 
CGATAGAATA AAATAACCTT GCACCGATAG 
CACCTAAAAT AAGACCAAGT ATAAATGCAA 
TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 
TTATAATTTC ACTAAATGAT TTACCCATGT 
ATTAACTCTT TTTCX3AATTT TCCGAAATCT 
AAAAAGCCGA CTGTTCCATT TTTTGTATTT 
ATATCATAGG CAGTTGATAG ACTCX3ATACA 
ACTAATCTAA CGATATATGC ATCTTTTTCT 
AAATCATCAT TTAAATCGTC TTTCACAAAT 
AAAACCTGTG TCACCGGTCC TTGTTCTATC 
TCACAAATAC GACGAATGAC ATGCATTTCA 
TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 
CTTGTTGCCT CATCACAAAG CAAGACCGTT 
CACGTTGCTT TTGTCCACCT GATAACTCTG 
CGACGAGTTC GACAAGTTCT AATGCTTTTT 
TTTCAAGCGG AAACATAATA TTTTTTAACA 
GGAAGATCAT ACTTACTTTT TGTCTTTTTG 
TATGGTCTCC ATCTATAATA ACTTCACCTG 
GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 
GAATCGATAA ATTAACGTGA TCTACAGCAA 
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AAAGATGTCA TTAATCAATT GAGAGhGPiAA GGATATAAAG TATTTTTGGC AACAGGACGT 4 8oo 



TCGCATTCTG AAAATACATC AACTTG 
^ (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 6 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



4826 



240 
300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 
TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 
20 TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 
TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 
ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 3 60 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 
TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 
TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 
TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 



660 



720 



TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

45 CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTrTTTCTAA AATTTCAGCA 1260 
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TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATATTGATT 3060 

TATTCAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 3120 

TTATGGAACA AATTCCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTITAGAAG GTGCXSTTAAA ACTTAAAGAA ATTTCTTACA 324 0 

TTCaAGCAGA AGGTrrTCCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 33 00 

AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3360 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 3420 

AAAAACSAAGG CGACACTTAT GTCATTCCTC ATCTACATGA ATTGTTAACG CCATTAGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 3540 

ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 36 00 

TCATCTATTA AAATTAAAGT ATATCGCACC CmTAGATT AATCGACTAG AAGGGTGCTT 3660 

-rrrrAGGTCG ACrraGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 3720 

GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TrTTATGAGA 37 80 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 3 84 0 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG GCAAATTGCT AAAGGTGATA 3 900 

30 AATGGATATT ATATCGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 396 0 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4020 
GGTATTCTGC AGAGACTCTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 
AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 
GTGTOTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 

TTOGAATGTC TCCTAAAGCG CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 4260 

GAGTCATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4320 

CAGCTGGTTT AGACTTTATT GCACGAGAAT OGTTGTTAAG TATACTTGAC TCATTGTCAG 4380 

ATTCATATCC AACGCTTGCG ATGATTTATC TGACGCACTT TATTGAAGAA ATAACTGCTA 4440 

ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4500 

ACATATTAAC TTCTCAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4560 

GGAATAATCG ATTTrCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 4620 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 46 80 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATC!AACTT TGTCAAAATA AATGTGACAA 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 

s 

ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTCCAA 1440 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 1500 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA iseo 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGCGAT GATAGCATGT 1680 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCX5GCAATAA ATCATAATTT 174 0 

20 ACGCGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATAT7X3 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGCrTTAGA AAAATTAGAA TACAGAGGTT i860 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 1920 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 1980 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 204 0 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

30 

TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AAC7VGATACA GAAGTTATCG 2160 

TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA nTACAAAAG 2220 

^ TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 2280 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTCAA GGTTTCAATC 2340 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 2400 

ACCATGAAAT CGTTArTCTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATCGAAACG 2460 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2520 
ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 
« AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC AATGATGTTA 
AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 
GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 
so TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 
GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAAITAGGT CATAAATCAT 
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TACCAAATTT ACTTGAAAGT GCACSATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 1260 

TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 1320 

TGATTGATAT TGCAGTTCCT CXSAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 1380 

TTAATTATGA TCTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 1440 
TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 
(2) INFORMATION FOR SEQ ID NO: 275: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1496 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC AGCTTCTAAA TCCTGCTTTG 60 

GTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA" TGATACAAGT GCAGCAAGGA 120 

25 AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC ATTTAAACAG TAAACTATAA 180 

ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT TGCTTGATAA GTTGCAACAC 240 

CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT ACATATGGGA 300 

^ AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT GACGTTGCTT 360 

TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT ACCTAAACCT GGTCCAGGGT 420 

TAGATTCAAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC AGCTTGATCT GCACCAAGCG 480 

GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATTTTTGCA GGCTCTACTA 54 0 

AAATACTTAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC TTCAACTGCC ACTGATAAAA 600 

TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC GTCATGATAA 660 

ATCCTAAAAT ACCAGCAGAA AAGTTATTAA ATAACATTTC AAAACCTTGC GGCGTTCTAG 720 

GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC AAGTGGACCC ATAATCATTG 780 

CACCAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC CCCCATAGTT GCTGTTGCTG 84 0 

CGATGATACC ACCACGTAAA TCATAAATTA AACGACCACC ACTAAATGCG ATCAATAATG 900 

GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA ATCTTTGTTA GGTAACCATC 960 

50 CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA ATGTTTGGCA 1020 

TGATCATACT ACTTAAGAAT GATCCAAATG CTTGAACACG ACGACCAATT CCTTTTTTCT 1080 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCXSATrr 
TQATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATrTC 
ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TAITCTAAAA 
AAAGAATTAA TTCATATnT 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
20 GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA ACACAAGATG 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 
ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAGCATAT TAAAAATATT AAGACACAAT 
25 TAGATATGGA CCCAGACGAT ACAATTGTAA GTTATTCATC AATTCAAAAT AATAmaCAAC 

AACAAATATG GAATTTAATT GAACCGTATA TTrCATAGTT TTTGTACGTC AAAACTTATA 
CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT TGATAACATT 

•rrrCAACGTT CATCATTTTG tcaaaaactc aaaagtaaat tagaaagatt ataatttatt 

TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG TTAGTATATA 
TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 
ACTAAGAGAG CAAGTTACTT TTAGAGATGA TGCCTTACGA AITCCCCATC AAGATTTATA 
TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATCTAATCGA ACTGAAGTAT 
ATGCTGTTGT TGATCAAATT CACACAGGTC GTTACTATAT TCAACGATTT CTAGCTCGTG 
CATTTGGATT TGAAGTAGAT GATATTAAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 
CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 
45 AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 
CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAGAGCA CATAATGAAA 
CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 
TATTTGGCAA ATTGAAAAGT AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 
TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 
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ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA 
ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT 
GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG 
GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA 
TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT 
TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT 
TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG 
AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA 
GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA 
TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 
(2) INFORMATION FOR SEQ ID NO: 273: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TAAGTGTTCA 
AACCACAAAA 
ATGTCACTTT 
AATGAATCTC 
GTGTGATGCT 
GAATTAATAT 
TTGTAACCAG 
TCATTTGTTG 
ATGATGACTG 



TATAACAAAA 
GATTTTTTGC 
GCTTAACTTG 
TTTACCAGTT 
GGAATGGATA 
TTGAGGTCCG 
TGAACGTAAT 
ACGTTTAGGC 
AACTTCCGTT 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
TITTGAACAG CCATATITAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 
TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 
ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 
ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 
ATT^CCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 
AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 
GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 
AAAATAAACC ATCAAACAAC GAATCACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 
TTCTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 
ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 
ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 
AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 
AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 
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AATGATCATT TTCGAATTTT TGAACATAGA AAATATCCTT GTCTTCGTTG TTAAAAATAG " 373 

CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTGTATG CTCAGATGAA GTAATAGGAA 373 

TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 334 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTCTGAATAG 390 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAATCTAT 396: 

TTAATTCATA ACAAAAACCA CCACGTTTAT CATGAACAAT TTTATTAAAA AGTCCATCGA 402. 

TATTTATAGA TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 40e( 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA CGAGATCAAT 414( 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTCTATTAA 420C 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTCAATTT 426C 

20 TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 432C 

ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4380 

AAATGCAACA ATACAAAGGT AGTTGCAAGA CAAGTTGCCT TATCTAGACC AITTGTGTTC 4440 

TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCCAA GTAATTAAAA ATAATGCAAC 4500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 4560 

TAAATCTAAC TTATCAGCAA GAATTTGTAG CAATTCAATC ATCCCAATCA CTAATSCTGC 4 620 

CATGACTGAT ATCGCAGTAA TCGTTATATT GTAATAGATT TTCCGAATAG GATTCAAGAA 4680 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 4740 

TGATGCGAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGGCTT GTTSTCATGC 4 800 

GCCTGMGPA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4860 

AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTGAAATAAG GTCCTACAAA 4920 

^ TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATCTCTTC 4980 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AGAGATTAAA ATAATTAGAT TCAACACACC 5040 

GATAAGCACT AAAAAGAAAC CTGAAACTAG TGTACCAATC GTTCCACCAA TATCTTCGAA 5100 

45 ATGCGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG CCATTAAAAA 5160 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATGGAT CTTTGCGTTG 5220 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TGATCTGCAT CAAATCCATG 5280 

ACGCAAACCT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCTTPTCC 5340 

AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATCTATCG 5400 



25 



30 



SO 



SS 



MSOOCID: <eP_0786519A2.L> 



1086 



15 



20 



40 



45 



EP 0 786 519 A2 

ACCATGACAA TAGTTGTATC AAACGATTTA TTCATTTCTT CCAAACGTTG TAATAGGTCA 1920 

TTTGCACTrr TCGAGTCGAG TCCGCCTGTT GGCTCATCTG CAAATATGAT TTGTGGTTTG 1980 

^ TGAACAAATG CTCTCXSCTGC TGCAGTTCTT TGTTGTTGAC CACCAGATAA TTCGCTAGGG 2040 

TATTTATrrC CTAGGTCATA AATACCTAAT GCTGTCGTGA TCGCTTTATA ATTTTCTTCC 2100 

ATTCTTGCCT TCGACATTTT TTGAACAGAT AAAGGTAACA TAATGTTTTC TTTAACGGTT 2160 

AATGTCGGCA GAATACTGTA ATCTTGGAAG ATCAAACCTA ATGATTCTTT GCGGAATTTG 2220 

GCAAGTGCTT TTTGATTAAG TrrATTAAGC TCTTGTCCGT TAGCAATCcg cTACCGCTAG 2280 

AAATTTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCTGAT CCAGAAGGCC 2340 

CCATAATCGC AACGAATTCXJ CCTTTTTCTA TGTCAAAGTT AATATCTTTA AGTGCTTGAA 2400 

ATGTGTGCTT TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 2460 

ACTCCrrtTG TATTTAATrr CATTTTAAAT AATGTTTGGA GTAGTAGCCT TTATCTAAAC 2520 

TTACAATTCA ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTAG TAGATTTTTG 2580 

AGTCAGGATA CAGATTCATC GTACATATTA GACAAAAGCA ATGGTGCTTT CTAAGTGATG 2640 
25 ATCTTTGTGT AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT TTTGTAAAAA 2700 
TTAAAAGAGG GTTTTATTTG AAAGGAATTG ATTGTTATGG PJ>J>JJKGGPJ^ TCAAGGTATT 2760 
AAATGGTCTA GTTTAATAAT GGGTGTATTA TTATTAATGT TGGCAGTCGT TATTTTTACA 2820 
^° TrrCCAATXG AAAATTTTTA TGCTATTACC TGGTTGATTG GACTGTTTGT ATTAATTAAC 2880 
GGTGTGATTC AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGG TAACCAAAAT 2940 
TCGATTCTGT TTATGGGGAT TGTAGATATT CTATTTGGTC TATTAGTTAT TTTTAATGTT 3000 
GGCGCAAGTT CAGCATTCTT TATTTATATG TTTGCTTTTT GGTTTATTTT TAGTTGTATC 3060 
TCTGGATTAT TTACGTTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT GAnTTTAAT 
TTATTAGGTA TTCTTTTCGG TGTCATTTTA TTATTTAATC CATTAATGGG TATCGTCTTT 
ATTTCGACGA TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATTTATGT TGTAGATGCA 3240 
CTTGCTTAAG TAAAATCAAG CGGTTCAAAA GAAGGGTGTG ACATGAAGTT TGTGTCATAT 3300 
CCTTTTTGTT GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA GTAAGATACG 
CCCATATTTT TATATTTTAC TATTATTGTT TTTCAATACG ATTAATAGTT ACATTTAGTC 
CAAAATATTT TTCTAAAAAA TGTTTATAGT TATCTTTAGT GACATCAAAT TTTTCTGAGC 34 80 

SO TACCATTCCT TGTTAAAGTT AAATGATTTT CAGACATTGT AGCACGGCCA AATGATTGTG 3 540 

GCATTGTAAT TAATAAATGC TGTACAAATA TTGAATCTGG ATGCGTTTGA TTATATTCGA 3600 
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CGTAAAcCTA 
TGTTTAATAT 
ACAAATATTA 
ACTTTGTCTC 
ATATCATATC 
GGACTTACTT 
TCTTCTTGTG 
AATGGATATT 
TtAACTTGTT 
AAGTAATAAC 
ATGTAATTTG 
CGTTAAATTC 
CCGCAAAACA 
TGGCATTTTT 
CX3CGTCCATT 
ATAAATAGGC 
TTAGTGCTTT 
CAATACCTAA 
TTTTAATTGA 
GAAATAAAAT 
GTTOSAAATG 
ATAaTTGTGC 
TCATTtGTTC 
ATAGCGCAAA 
TGACAATAAT 
TCGAGTCGTC 
CGATGCTAAA 
TTTTGAAAAC 
ACAATGTCTT 



TGCGTTTTAA 
AAATAATACA 
GAATACCGTT 
CAAACTGTTT 
CATAAAAAGT 
CGATGACAGG 
TAcACGGAAA 
AATAATGTTA 
ATTTGCACTT 
CTGATCATTA 
TTGTTGGCTT 



ATTTGGTGCC 
CAATACAGTT 
CTTCATTCTG 
TTTTGATTTT 
GCCTACAACC 
GAACGTACCA 
TACGCCTGAA 
ATCTTTCATC 
CAGTACATAC 
TATCGATAAG 
ACCTGCAATG 
AAGTGCTAAC 
TTCACX5TGTA 
GATAAATAAA 
ATTATTTACA 
AAATAATGAA 
GATATGGTTA 
CATAAAAGGC 



TATTCTGAAG 
TCCAGCTGCT 
AGTAGCATCG 
TGCAATTGCT 
ATGAACGTTA 
TGAGTTGAAG 
GTTTCATTCT 
GGCAGCGATT 
AAAATAGAAT 
ACATTATCTA 
AGTTTAGTTT 
ATAGATGTAA 
ACTGTAACTG 
TACATAATAG 
TTCAATGTTT 
GTTAAAAATA 
AACATTTCTG 
ATAACCTCTG 
ATTGATAAGA 
GCAATAATTA 
TTAATCGATA 
CCACATAATA 
ATTTnCAAAA 
CX3GCGTTTTA 
AATATTGATC 
CCTTTAGTAA 
AATAAATACA 
AATGTCATTT 
CTGTTTAGAA 



TTACTTAGTT 
ACTAAAAATG 
ACCTCTTTTT 
TGAGCTTTTT 
TTTTGTGTTT 
CTTAGATTTA 
TAGTACCTTT 
TTGTATTTGT 
TCX^GCXSACC 
CAGTAATAGT 
CAAATTGTTT 
GGGTTTGATC 
CAGA7UVTAAT 
ACGATGTGAA 
TAAAAATAAG 
AAATGATAAA 
TCGCCATATA 
CAGTCGTTAC 
TACTACGTCT 
GCATGAAAAT 
ACGACATCAA 
CACCGACAAC 
TGTTTTGTCG 
CXSrmTAAATG 
CfGACTAAAGC 
ACTGTAAGGT 
TTGCATAATG 
GAGACACCAC 
CGTCCTTCCT 



CATCCTCAGT 
CTAATCCTAA 
TCATTTCATC 
CCTTTTGTGA 
TCAACTGCTG 
AAGGATAAAC 
TACTACTAAA 
AATGATGGCA 
TGAATCACTA 
TTCGTAAGCA 
AGCATCTTGA 
TGTATTTGAT 
TGCAATGATA 
TACT^CATCG 
TGACACGGAA 
CX3GTGATGTC 
ATAGCCTAGT 
TTTAGCAGTT 
TTTTAGAAAT 
AGTTAAAACA 
TTTAGATACT 
ACCTGTGATT 
TGTTAAACC7V 
ATTGGCATAC 
ACCTTTCTTA 
TGTAAAACTG 
TTTTAAGTTT 
CTAATACTGA 
GATAAAGTTG 



TTCATCCATT 
AAATGATGTA 
ATACGTGATG 
TGTTTGTTTA 
ATACTTTTCA 
CTTACCTTTG 
TCTTTGTTTA 
TTGTTGCCTG 
CCATTTTCCA 
TTTTTAGAAA 
GTAGCGACCA 
TTAGATAATG 
GTTAAAGACA 
GTAATAGATA 
CTTCTGAAGA 
ATAGCCATAG 
GCAATCATAG 
GTGGCATCAG 
AAAGCACTTT 
AGGGCCATAG 
ATGGAAAGCA 
AAAAATACGA 
ATCaACTGAA 
ATTAAAAAGA 
ATGATGGCCA 
AAATATAAGA 
TGTCGTAAGT 
TTGAAGATGT 
TGTATGAATT 
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1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 
AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 120 
TTGGTAAAGG TAAAATTCAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 
TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 240 
TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 300 
AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 360 
GCCATGGTAA AAGCCrTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 420 
CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 480 
TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTIGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 600 
CAAATCAAGA GACGTATCAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAC 
AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 
AACTACCTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 
25 ArrrATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 
AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTrTG 
TTTCTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATOAAATCA 1020 
AAAGGCAATT AACTTATTAT GATCAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 1080 
TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 
GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 
TAGT^GCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 1260 
ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 
TACAAGGATC AACAGGATAC GGGTATCATG ACTTTGGACG TGATCATTTA G 
(2) INFORMATION FOR SEQ ID NO: 272: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 



660 
720 
780 
840 
900 
960 



1140 
1200 



1320 
1371 
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IS 



25 
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ACCTAAGAGG 


TGTGGATATG 


AATAAACACA 


AGAAAGGTTC 


TATTTTTGGA 


ATAATAGGAC 


900 


TTGTTGTCAT 


ATTTGCTGTT 


GTCyCaTTTT TATTTTTCTC AATGATATCC GATCAGATAT 


960 


TTTTCAAACA 


TGTTAAATCC 


GACATTAAGA 


TTGAAAAGTT 


AAATGTTACA 


TTAAACGATG 


1020 


CAGCAAAGAA 


ACAAATAAAT 


AATTATACGA 


GTCAACAGGT 


^ ^1^^^ Ik Ik m rn m ^ 

ATCAAATAAA 


AAGAATGATG 


1080 


CATGGAGAGA 


TGCATCTGCA 


ACTGAAATTA AAAGTGCAAT 


GGATAG CGGT 


Ac i I'i. J. ATCG 


1140 


ATAATGAAAA 


GCAAAAATAT 


v-AA l l"l~i"rAG 


ATTTATCAAA 


GTATCAAGGG 


ATTGATAAAA 


1200 


ATAGAATTAA 


ACGTATGTTA 


VjTAQjA T AGAC 


CAACGTTATT 


Tk fk Ik ^^ik^n^ 

GAAACATACG 


GATGATTTCT 


1260 


TAA7VAGCTGC 


TAAAGATAAG 


CACG I i AACG 


AAGTTTATTT 


AATTTCACAT 


GCATTATTAG 


1320 


AAACTGGCGC 


AGTTAAAAGT 


VaAATTAGCTA 


ATGGAGTCGA 


AATTGATGGC 


AAAAAGTACT 


1380 


ACAATTTCTA 


TGGAGTAGGA 


GCCCTTGATA 


AAGACCCAAT 


TAAAACAGGT 


GCAGAATATG 


1440 


CTAAAAAGCA 


TGGTTGGGAT 


ACAt,C 1 GAAA 


AAGCTATTTC 


AGGCGGTGCT 


GATTTCATTC 


1500 


ATAAGCACTT 


CTTATCAAGC 


ACAGATCAAA 


ATACATTGTA 


TAGTATGAGA 


TGGAATCCAA 


1560 


AAAATCCAGG 


AGAACATCAA 


TATGCTACAG 


ATATTAAGTG 


GGCAGAAAGT 


AATGCAACAA 


1620 


TTATCGCTGA 


CTTTTATAAG 


AACATGAAGA 


CTGAAGGAAA 


ATACTTCAAA 


TACTTTGTGT 


1680 


ATAAAGATGA 


CAGTAAACAT 


TTGAATAAGT 


AATTTGATAA 


GCTACGAGTT 


GI'l'lTiATGA 


1740 


CTCGGACATA 


CTAAAAAGAC 


GCTTTCTATC 


TTGTTTTGAT 


AGAAAGCGTC 


■i"i-i"l"iGCATT 


1800 


AGAGAAAACA 


CATTGATkGA 


TAAtCCCaCC 


aATGCAAgTG 


GGG c AGGACa 


TCGATAAAGA 


1860 


ATTACTTTTT 


CTTTAGAAAT 


TAGTATTTCT 


TATGCATGAG 


TTTTACTCAT 


GTATTCCTAT 


1920 


TTTTAAGTAC 


ACATTAGTTA 


TAGCTAATGA 


TAAAGAACCA 


CTACATAATA 


AATCATTAGT 


1980 


GTTTTTTTAT 


CATTTCTGTC 


CCaCTCTCAT 


CGTGATTTGA 


AATTTTCAAT 


TGCGATTTTA 


. 2040 


ATTTCATCTC 


TTACACGTTG 


GAACTCTGAC 


CAAGGCTTGC 


CTGCAGGATC 


ATCAAATCCC 


2100 


CAATGTTCTT 


TCTTAACATT 


TGTTGGTAAA 


GAAGGGCAAT 


TTACGTCTGC 


ATCACTACAT 


2160 


AATGTAACAA 


CTAAATTTGA 


ATTTnTAATA 


ATATTATTAT 


CGGATTAAAA 


TCTGATGGAT 


2220 


GATTTGATAT 


ATCAATGCCT 


ACTT 








2244 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 1560 

^ GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 1620 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 1680 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 1740 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 1860 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

IS 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC ACAT 2004 
20 (2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
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35 



40 
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AAAGATTGCT 


TGCCTTGAGG 


GTTTATATAT 


CTGACTCAAT 


TGCCACATTT 


TTATCAAGAG 


60 


TAGTTGATAA 


TACTCATCAT 


AATTATAGCT 


AATATTATAT 


•ri'TTTTAAAA 


GATAGTGTAT 


120 


GATTTTCTGG 


TGTTTGTTGT 


ATACGTCATT 


AAATTTCAAG 


TAGTCATTCT 


CCAAGTTATA 


180 


CGTATAACAA 


ATATTTCCGG 


ATAAAGTTAG 


AATAAAATAT 


TTAGAAAAAT 


CATTCATTTG 


240 


CGT^ATCGCT 


AAATTAAGTG 


TTAAATATAA 


GACATAAGTA 


ATTAATTTAA 


TGTGATATGA 


300 


TGTATTATTA 


CTTTGCTAAA 


TAGTAGATAG 


AACAAAATTT 


GTAATCGGGA 


GGTAACAATG 


360 


GATTACGCAC 


ATTTAAATTT 


AGAACATTTT 


TTTGCACGAA 


ACGACGATTT 


AGATGTTATA 


420 


AGAGATCGCG 


CTGATTTCGT 


GATGATAAAT 


AACTTCACTA 


ATGAAATGAT 


GTATCGTGAT 


480 


GGTCAAATTG 


AAGGCACGAT 


TGATTTAAAT 


CAGTACTATT 


ATAAAAATAG 


ATCAAATGCA 


54 0 


GCAAGTTTTA 


TTATGATGGA 


TTATAAAAAA 


GAAACTAAGT 


AAACGAACAA 


AAGAATTTTT 


600 


TGTTTTTTAA 


TACGTGAATA 


ATAAGATTAT 


TGATATAAAG 


GTTTTCAAAG 


GTTATACAAA 


660 


AAGATAAAAC 


ATTTATGATT 


CGTAGATCAA 


CGTAAAGTAA 


TGTTGATAAA 


TGGTTTAAAA 


720 


CGTTTCATTT 


ACATTACTGT 


TTATTTATGA 


ATATGTAACA 


ATGCATAGAT 


AAAATTGTTA 


780 



-xiSOCX^tD: <EP 0786519A2. 1,> 



1081 



EP0 786 519 A2 



i 



10 



20 



25 



30 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1080 
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60 
120 
180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 
,5 TAACGCTPAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 
TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 
AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACrr art-ran a anna nni^^rnr^r^^ 

vjO.™-kx>j»„ i Xrt -JU O 

CTAGGTGTCT TCACTATACT TTGGCTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 
AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 
CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT ACAGGTGCTA 
TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 
CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 
TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 
CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 
ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 
TATAATCTGk TGACFCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTIGITGTT 
TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 
40 CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATrTTAGTTA 
ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 
CGTGTAAATT GATAt CGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 114 0 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA TTATAATGAA 1200 
TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA GCAACGACTA 1260 
CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 1320 
CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTTCAA 



420 
480 
540 
600 
660 
720 
760 
84 0 
900 
960 
1020 



1380 



10 



IS 
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TCTGATTTCC TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 3000 

AAGTTTATCa AAACTTTTTA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3060 

TATATAAATT ATTAGTATAT GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 3120 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 3180 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 324 0 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 3300 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 3360 

GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 3420 

TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 3480 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA 354 0 

TTCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 3600 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3660 

AGGCATCGAT TTAGACGAAT TGTATGACXSA ATTACAAATC GCCGAATTTA ATGACAAAGC 3720 

25 AAAGACCX3CT GACGGTAAAC CTTTAGCATT AGGTAACGGT AAAATCATTG ATCAGCCTCT 3780 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 3 84 0 

ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3900 

CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTTACA 3 960 

AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4020 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 4080 

TGCTTCAGAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 4140 

GCGACCAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 4200 

TTATT'CGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 4260 

TAOGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTATCX3TC 4320 

TTGAATAAGA ATATACCGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 4380 

4B ATGATTATAA CJKCCAACMA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 4440 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAGTAG 4 500 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT 4 560 

^° TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 4620 
ATTCCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 

55 
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TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACrTTTA ATCGTrTCAG 
TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 
* CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTmT CAAATTATAT TCATmTAT 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 
CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 
TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 
TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 
CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTCATT 
GAATGATTTC TCTTAAITTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCnTAC 
CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 
20 TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCaA 
TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 
AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAATGAITTA TTCTATGTTT 
GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 
ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 
GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 
TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 
TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC -TCTTTTATAC TGTAGCGATA 
TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TCAAAAGTTT TCTGGGATAA 
CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTIT CTTGTTACTG 
TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 
CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 
TATTTTTATT ACTTACACTT TGGGATCCTT •TTTCTGATTG ATCTCCATAT TTrGTTTGAA 
CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 
TCCAATTTTT CnTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 
AAAGCCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 
ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA ATTGTTGTTT 
AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTr GGAATTGCTC 
TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 
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10 



IS 



PJKGTTGCTTPiA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 1320 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 1380 

GTAATTTCCC ATTCGGAGGA GACATCATC 1^09 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

AAAGAGGGTT TTTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCTTTGGC 60 

CCGTTCATAA CCATAGGAAA ATATATGCnG nAATCATTGA TAGTAAATAC ATCAATAATA 120 

CGAATAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGCTAAAACA TTTTCCAAGT 180 

TTTCTTTTGA ACCGGATACT CATAAATGCT TTTAATGCnT GGTTTTGTCT GTCGCCATCT 24 0 

TTAGAATTTG TAATACTTTT TTAAATTCTT TAGCGAACAA CTCATTATCT TTATCGTTTT 3 00 

TAGCCATTTG ACGATTCAAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 360 

GCTTATTCAC ATTATTGTCG ACTTGATTTA ATTGGCCACT AACAGAATCT GCAATTGATT 420 

TTGATTCTTG TGTATCTGAT AGCAATTGCG TACTCTTTTC AGAAATTCTA CTAATTTCTT 4 80 

TATCTAAATT TGAAGACATC GTATTAAATT CATCATTTTT GCCTTTATCA ATTTTTGGTT 540 

CTTGTGGCTC TTCAGCAAAA GTCTTTTTAA CGTTTTCTAA CTGATCAATC AGCTTGGAAA 600 

TATCTTCT T k ATTTTTTGTT gTATTCTTTT TGTTATTTAA AATGTCATCA ATCAGTTTGT 660 

CTGAGTTTTC TTCCATTGAA TCAATTTGaT GTAACACAGC TACTTTATCG TCTTTGAAAC 720 

TTTCCATGTC ATTGATAACT TGGTCAACCA TCATATCAAT TAAACGTTTG TTGTCGAATG 780 

GTTTATCTTC tCTGCCTTTT GTATCTGTGT ACATTTTATA ATGCGCATCA AACCTAGATA 84 0 

ATGCACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG ACCATCaAGG TTATGATTCA 900 

ATGTTACATC CACAACACTC GTTGCTTTCT TATCATTTGG TTCATCTTGA CGATTTGCTT 960 

GTCCAAATAA CAATTGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 1020 

TTTTCAATTT AGCAACGCCA TTGACTTCAA CTTTATATTC CTTGTTAGAT GTATCGAGTT 1080 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 
TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 

5 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTACGTATTG AAGATGCATT 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTCAAA CAGGTGT ioi7 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 
IS <B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
TTAATCCAGC GTTAACTGTA TTTGCATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG 
TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 
TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 
GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 
TCTATGGCGC AATCAATGGT CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 
TTTTCACAGC AATTTCATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 
AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 
TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTGCATTC TTAGCAGGTG 
TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 
ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 
40 TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTGCAT 
CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 
GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 
GTGTATTCGT TCCGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 
AAATTGAAGT TAGTTTAATT GGTAAAAATC AATTAACGAT TCCTATAGCG CAAAATTTAA 
CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 
TGTCAGATGA TATCACGATG ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 



45 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
TTTCCAAGAA GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTTGGTT TACGACAAGG 
TATCGACAAA GCAGTTAAAG TTGCTCTTGA AGCGTTACAT GAAAATTCTC AAAAAGTTGA 
AAATOAAAAT GAAATTGCGC AAGTAGGTGC GATTTCAGCA GCAGATGAAG AAATTGGACG 

TTATArrrCT gaagctatgg aaaaagtagg taacgatggt gtcattacaa ttgaagaatc 

AAATCGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAArTTGATC GTGGTTATCA 

atcaccgtat atggttactg attcagataa aatggttgct gaattagaac gcccatacat 

TTTAGTAACA GATAAGAAAA TCTCGTCTTT CCAAGATATC TTACCTTTAT TAGAACAAGT 
GGTTCAATCT AATCGTCCAA TCTTAATTGT AGCTGATGAA GTTGAAGGCG ATGCATTAAC 
AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA AAGCACCTGG 
TTTTCGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG GTGCGCAAGT 
GATTACTGAT GATTTAGGCT TAGATTTAAA AGATGCATCA ATTGATATGT TAGGTACTGC 



2940 
3000 
3060 
3120 
3180 



TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA TTAAAGACGC ATTTCTTAAT 
GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT TTGAAGGGCA TTTTGTTGTA 

^ TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTITG ATACTATTTT TATTGATGAA 

GTCGATGCCT TTCCrTTATC AATGGATAAA AATTTACAAC AAGCATTGAA GTCATCTTCT 
AAAGTTGAAC ATGCAACAAT TTATATGACA GCAACACCAC CGAAACAACT TCTGTCAGAG 
ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 3 24 0 

CCTAAATATC GlTATrrCAA ACTTAATAAT AAGAAGATTC AGAAAATGTT ATACCGAATT 3300 
TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT TTTTTAACAA TATAGAAACA 3360 
ATCATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA AATTAACATA CGTCCATAGC 
GAGGATCTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA ATGGACATTT CGATGTCATT 

20 TTTACTACGA CAATATTAGA ACGTGGATTT ACAATGGCAA ATTTGGATGT TGTTGTTATC 

GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG CTGGACGTG 
(2) INFORMATION FOR SEQ ID NO: 266: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



3420 
3480 
3540 
3589 



€0 
X20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ATTTACCGTC 
GCTTTGGAGT 
TAAACGCTTC 
TTTGGTTTCT 
CATAATTCAG 
AAATGACGTC 
TACCACCAAA 
TGTCTAACAT 
TATTCATTtC 
TTTTTATTGC 
CAATGAATCT 
TATTTTGTGT 
CACTGAAAAT 
GTATCATTGC 
TCTACAAGTT 
AGTGTGACTT 
GCCATTCAAA 
TTATTTGTCA 
GATGATTGTC 
CATTTAGGTT 
TTATAAAATT 
CTTTAAATGT 
TTAGGCTCTT 
CATCCTCATA 
CAAAATTCAA 
TATGATTCCT 
CGAATGGATA 
TATTATCATC 
CGAGCCATTA 



TTATGATAGT 
TTCTTCAGTC 
ATAGTCACTT 
TAACATAAAA 
TGTTACTATA 
TCTAACAGCA 
ATATCTTGTT 
TGGGa CACCG 
AGGTCCAATA 
AGCAATAAAT 
TGATTTACTA 
CATAATAACT 
GACATGACTA 
TATGAGTATA 
ATCTGTCGCA 
TTGAAGATGG 
CATTAGAAAA 
TAAATGGAGA 
CTTCAGCTTA 
CTGGTGGATT 
TTAATAAAAG 
TCATCATTTG 
TTTTACGTTG 
CTTCACTTAC 
ACTATTATTG 
CAGTACATTC 
ATGTAACAAG 
TCTCATTTGA 
GAAAGAGACA 



GCTTTTTATT 
AAATCGAAAT 
TTAACGACAT 
TGAGTTGAAG 
CATGGTACTG 
CCGCTATAGG 
ACTACGACAC 
GcAGTTCCtG 
GTATATGCAG 
GCTTTaGCTT 
ATCACATTTT 
CCTTAATTCA 
TAAATCGTTT 
TTTTAGGAGG 
GGACTTAATC 
CAAGATTATA 
GAAAGTATTA 
TCATTTCGAA 
TCAAGTAGCA 
AGGTTTAGGC 
AGTCTATATT 
AAAGCCAAAA 
AAATGAGGTT 
TAATGAAAAA 
TGTTCAATGT 
CAAGAAAATT 
ATATAAAATA 
ATTGTCGGAA 
AACGATTTTG 



TTTATTCAGT 
TTCCTGCAGT 
CGATATAGTA 
CTAATTCATA 
CTTCTCGTAG 
CGCGAATTAA 
ACGCATTATG 
TCGGTTCACC 
AACAATTATG 
CATCTTCATT 
CAATAATGTG 
TAAGCTTAAG 
GATTGCCATT 
ACGACTATGA 
GATAAATATA 
CCAGAAGAAA 
GATATTGTAA 
GATGGTCAAG 
TACTCTGAGT 
TATGTTGGCA 
GTAATTGGAA 
TGCTAAAGAT 
TTAAGCATTA 
ATTAAAAAAG 
GAAAGTACAA 
GTATATTGCA 
ACAGAGAGTT 
CAGCAGTCTT 
TTATATGCCG 



TGGTATATCG 
CATTTGATTT 
GCTTACCTTA 
TTCAAATTTA 
TTCGACACGC 
ACCGCCACCA 
aACATCGrGC 
ATCATCATTC 
AGTGGCATCT 
TTGAACAGGT 
TTCTTTTTTA 
ATTATTTAAT 
TTCTTTTTAA 
AAATTGCTGT 
ATATTcAAAT 
AAGTTCGTAC 
AAGACTTTGA 
CX3TTATACAA 
TTGGTCCAGT 
GAAAAATAAG 
ATTATCTCTC 
ATAAGAAAAT 
AACATTACGG 
AAGTAACAGG 
ATCCAAAGCA 
GAAATTGTAT 
CGCAAAGTTC 
ATGCCTCAGA 
TAACAGGTGC 



AAAGGTAACT 
AAAAAGTTAA 
TCAGTGTAAG 
CCAGTTTGAT 
CCGATATCAT 
CCTAATTTAA 
TTTTTTaATA 
GCnTTTTGAA 
TTATGTTCTT 
TTGATATGAG 
ACAGTAATGA 
CTTCATTATA 
CTGAAATATT 
GATGACCGAT 
AGCGCCATTA 
TAAAAAGCGT 
AGAAGTAACT 
AAAGTTACAA 
TGTTGCAGCA 
ATTAACATAA 
GTATACATGG 
CATTATAATA 
GAAATTAATT 
TGTCATCAAA 
TTTTTATCAG 
ATCACTGGGT 
ATCACAAGCA 
ACATATTGTT 
AGGTAAGACA 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

174 G 

1800 

1860 

1920- 

1980. 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCyVCTTAAAA GGTGATGCAT TCAATCAACA 
GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 
TCCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 
AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 
AAGATGATGT TTCCCCGTCA ATCGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 
GTATTTCCAT C 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5160 
5171 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
CTACACACTA AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC ATTTTTAATA 

GGTATTTGGA ITGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 120 

ATTATGTACG TACTTGGGCT TGTAGATGAT ATCTACGATT TGAAACCGTA TATAAAATTG 180 

GCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 240 

TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 300 

TGGATTGTTG CTATTACAAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 36C 

GGTGrrrCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 42C 

TTCATAACGA TGATTTCTIG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTTACAAT 48( 

TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATGATTGG ATTTATCATC 54< 

GGA-^crrr ctttactcgg attcaaaaat attacaatta rrGCATTGrr cttcccaatt 60. 

GTTATCTTAG CAGrrCCATT cattgatact ttgttcgcaa TGATTCGACG TGTGAAAAAA 66^ 

GGGCAGCATA TAATGCAAGC TGATAAATCG CATTTGCATC ATAAACTATT AGCTTTAGGC 72 

TACACACATA GACAAACAGT ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 78 

AGCATTATTT TGTATCTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 64 

TITAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 90 

TTAAATTTAA TTAGTCGTAA GTCATCTCAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 9fi 

TATGGGATAA TTTCTATTAT ATGGCTTTAC TCTTTACAAT ttttttgtat taaatttcaa lo: 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 
CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 
TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 
TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 



3120 
3180 
3240 
3300 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 33 60 



TCATnTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 34 2< 

i-i-i-i-i AAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT TTTTATATAA ATTTGGCTAA 34 8 ( 

TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 354C 

ATCTATACCA TCCACCTTTT ATTTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3 60C 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 366C 
AATATATGTG TAATTCTAAA 

— — — — * w-i\_%j rw-rtX-M J. i. X J/iiCi 

TTAACATTTT CATTTATAAA GCGAAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 378 0 

GATGCATTGA ATTCACTAAA ATGATTAAAT TACTTATATC TTTTCATCTG ATTGATTATC 3 84 0 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 90 0 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTrGGGCAAC 4020 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 080 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 414 0 

35 ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4200 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 4260 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4 32 0 
TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TCATACTGAC TATAAGGCGC 
TAAATTATGT TTCCCATCTT TCGATACAGT AGAGATCCAC CCTATTGGAC GTGGTACTGT 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4 500 

ATTCATACTA TCTTCACCCC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4 56 0 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 62 0 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT TUITATTTATG 468 0 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 4 74 0 
AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTCGAT 1320 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 1380 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 144 0 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 1500 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC ATCAGTAAAA 1560 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1620 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 1680 

TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 1740 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

2^ GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTGACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 1980 

TCTTGGaACX5 TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AGCTGTTGAA 2040 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCX3CCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 2280 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 234 0 

AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 2400 

TCCTGTGrrA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 2460 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 2520 

^0 GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2580 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 264 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 2700 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 2760 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTC TACTATTAGT TGTCGAAGTG 2820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2880 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2940 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC ATCAGAAGCC 3000 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 
AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 
AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 6C 

TGAACCTGAT TGAPTAriTAr' TT'vn'vniiTiyTi. Tr>% rrrrrn r^rrr. fr-o-^z-T..* ct,^,>-, 

■ — - *wrtw**«^j.x^ X v-nvjxvMX v_j\Vj liu 12 C 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 18C 

GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 24 0 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 300 

GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 3 60 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 42 0 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 4 80 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 54 0 

35 TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 600 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 660 

AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTG AAATAGAATC 720 

GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 780 

CACTAAAETT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 84 0 

AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 96 0 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 1020 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT CTGTAGAAAC 1080 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 114 0 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 1200 
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TGATATGTTG TATATCCCAC TCGTTTTAAA CCATAAGCAT CTTGCAATAC TTTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 5220 

TCATTGACTT CCGGAACGAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 5280 

TCTATAACGA TTGCACCAGC TTTTTCAAAA AGTGGGGCAA AGTGTTCGCT TGTACCGCCA 5340 

CCAGCACTCA TTAATACATA ATCGAAATGT TCACTTGCAC GAGCATCAGT TAATTCTTGA 5400 

ACTGTATATG TTTTTCCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 5460 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 552 0 

ACTAATCCTG TTCCACCCAC AACTGCTAAC TTTGTCATAA CTTGTCACTC CATTTTATAA 5580 

TAATTTCCaA TrTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCGTCA 5640 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAATCATCAA TGACGTATGA AATACTTATT 5700 

TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTCAA TTAATGTCAA AAATGCTTTT 5760 

GATGCCACAC CTGACATATC ACGCATGCCT GAGCCAATTA ATGAAATTTT GACATAATGC 5820 

TCATTGATTT TATAAGCTAA TGCTTCATAT TGATTCTTTA ATGTTTCAAG AATCATAGAA 5880 

ATTTGATGAA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 5940 

ATTTGTGAAA TCATATCAAC ATTTACAGCA CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 6000 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 6060 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 6120 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TATAGTTTTT AGCTAATTCA 6180 

ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTTCATAT 6240 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 6300 

TCAWX3TCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 6360 

GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 6420 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 6480 
TGAGCAATTT TACnTITAA ATGATGGCCA ATGGTTrrAA TACCCGCTTG ATAGCCAGTC 6540 
ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 6600 
TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCrrGTTGTT TTGGTGCTTT AGTCAAGGTT 6660 
GATACATTCG TCA1TAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 6720 
rrGTrCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTA TTTTTGTAAA 6780 
ATCACTGACG GATCATCCGC CAAATTTCAA CACACTTCTT GTTACCATAT AATCCTCCTA 684 0 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 335 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 343 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 348, 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTC CAACATCGGC ACCTTTAACA 354, 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 360( 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGXTGATr CATTCCGCCA 3 66( 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 372C 

IS GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 378C 

CAAATCCTAA ATAACTTGTr AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 384C 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTCAG 390C 

CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAAOanar. 

' — - --^x-v-v-t^ A X j^you 

GACCGCCACG TTCATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATCCAAATG 4020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4080 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 4140 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTCCAATCG 4 200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4260 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4326 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 43 80 

35 TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATKWVCTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4560 

GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTIXSTT CCAATACTTG TACAGCATTT 4620 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATrGCC 4740 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4 860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 920 



25 
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AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTCTATATCC GTTTTCTAAA 
AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTCGATATGC TTCTCGTCCT 
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ACATGACTTG 
TCATCTCTTA 
GCTTCAATTG 
GTTTCTTCTG 
ATGTCTTTGC 
GCATGCATCA 
AACGCATCAA 
ATGACGCCAG 
ATATAAGCAG 
CGATGTTTCG 
ATATAAATAG 
TAAGGTTGAG 
ATTCAGTTTT 
CTTAATCACT 
AATCX3CGCCA 
ATTTGCACCG 
CACACCTGCT 
ACCGAGAGTA 
TGTTGCGCCC 
ACCTGGTTCA 
ACGATCCATT 
CCAATCATCC 
TTCTGGATAT 
TTTAGCATCA 
ATCGATCTCC 
ACTAAGCGTT 
TGCGTGATTT 
GTACCTCCAC 
TGTCTATCAT 



ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 
TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 
GATACTTATC AAAGGCACOG GCTTTTATTA ATCGATTTGC ACCGCCACCA 
CAGGTTGGAA AATGAAAACG ACATTTTGCG GTAATTGACC TGCATCTTGC 
AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 
CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 
TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 
TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 
TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 
TAACAAATTC TAATTCATTC ATAATTATCT^ ATCCTTTGTG TTAAATTACT 
TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 
ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 
CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 
TTTGCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 
GCTX3CAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 
ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 
AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 
GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 
ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 
ATTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 
TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 
GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 
GTGATGCCTT CAAAATTACC ATTTAAATAT ACTTTTATTG GTGTAGACTT 
CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 
TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 
CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 
GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 
GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 
ACACAGGTGT TACATTTTCT TTCAAAGATA CGATCACATC ATACAATTTT 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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2640 

2700 

2760 

2820 

2880 

2940 
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3060 

3120 

3180 

3240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
TATGGCTCAT CATTAATGCA CGTATCGGGT AGCGITTACC AITGATAAGT 
TAGCACGAGT TCTTAAAATT CCATCGCCAT AACCGATATC AACTACAGCT 
TGTTTTTAGT CACTTCAAAG GCAAAGCTAT AACCGCAATA ATCACCAGCT 
GCACTTGAAT AACATGTGCT TTTAAAGTTA ATGACTGAAC TATATCATGT 
AACTATATGG TCTTGAACCG TATAACGCAA TACCTACACG CGCATGTX3TA 
GTAATAyyyg TCCTTCCCGA TAAAAACTCG CACTATTTTG AGCATGGATT 
GATAACCTTC AGATAAAAGT GCTTCAACAA TTTCCATCCA TTGTGAACGT 
AAtCTGACAC ATCGAATTCA TCAGCATATC CAAAATGGGk CCATAAACCA 
TTTTTGCATT TTGATTATGA TGGTGATCTT TCAATACTTC TTTAATTTCG 
TAAATCCAGA CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA 
CATTTTTATG GTTATAGTAA TATGTCAACG ACGGCAAAGT CATGTGTATT 
GGACTAAATC AAACTCGTAA ACTGCATTCA TTAAAAAGAT TGrtGCATCT 
GTCTAATTTG AATTGCTTCT CGTAGTGATG TTGTGCTAAA TGTATCTATA 
TAAACTGAGT TACAGCAAAT TCTAGGTCAT AGTGATATGC ATTaTTTTTA 
TTAATGGCTG ATTGTTTTTG ACTGTGATTG CATTTTGTAA AAATATTTTC 
ACC^GTTGC TGTCAATGTA TTACACCTCT TTGTAATTAT TTAATAAATT 
TTAACCACGT TTATTAACAC TTTTTCATCA AAATTTAAAT GTGATGTGTG 
ACAAAACCTT TATCTTCATT TCGTGTTCCT ATAAAAACAA AGTAAGCTGG 
TGACCATAAA AACTAAAATC TTCCCCAAAT AAGAATGGCG TTGGTTTGTC 
AAATCAGCTT TTATTAAGGC GTCCTCTATT TGAGTACGTA ATTTCGGACT 
GGGGGATAAC CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG 
TCTGCTATCT TGTGCATTTG ATTTTTAACG ATTGTTAAAT CATCAATATC 
ATAGTACCTT CTAAATAGCC ATTACTTGGT ACAGTGTTAA TCGCTTCACC 
TGACCAATAT GAACAATATT TCGTTTCAAA CCGTTAAGGT GAAATTGTTG 



GCTTCATGTT 
AATTTTGTAT 
TGTACTTCGC 
TGATTCAGTG 
TGGTGGGGTA 
AGGTCGAACT 
TCAACATTAT 
CTAATAATCA 
TTTAAATCTT 
CCAGCTAAAT 
TGATGTTCAC 
GGAGCAAGTT 
CCTGCATGGA 
ACAACTGCCA 
TTATTTACAG 
TTCGTAAAAA 
CAAACCAGTT 
AGCTAGTTGT 
ATAGACATTT 
ATTGATTGTA 
CTTGACACTT 
ATATGTACGA 
AGCTTTAAAA 
AATTTGTGAT 
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180 
240 
300 
360 
420 
480 
54 0 
600 • 
660 
720 
780 
84.0 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 
CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 



TT 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 
TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA GAAATGAAAC 
CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC ATAAGATGCA 
TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTTCTAT CTGaGTATGA TAGAAATCGC 



CrreTTTATT TTTAATCrrC ATCTAAAAAG TCTTTAATAG CTTGTTTATT TGTGTTTTTA 
TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 
GGCaCAGTCA ATGACTTAAC ATCTTTTTCA CCTCGGATAC CaAAACTCAA ACCTGTTTGG 
AAAATCCCTG AATCAGGAAT GTrTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 
GGTAATTTAA CAACTGTTCT AAAATTAACC ATTTCTTTTT TCAATGTTTG CATCACTTGT 
TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT TGCATAACCA 
AGTAATTCIT TACCATTCAA CCTATGGTTA CCCTTTrTCA AAGATACACC AATATTTTTC 
GAoi^ATCTT TTTCGACATT AATTGGTACA CCITCAGGCA TTAATTCATC AATCATTTTC 
^0 TCAAATCCAG TAAAATCAAC TACTGCATAA TATTCAGGAT TAATTCCTAA ATTTTTATCA 

AGTGTTTTTC TAAGTAGCTC TGGACCACCT AAAGCGTATG CTGAATTAAT TTTGTGTTTT 
CCATATCCTG GAATATCTGC ATAAATATCA CGCATGACAG ACATCATTTT CATCTTTTTA 
TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC ACCTTGTGCT 
TTATCTGCAC CGAGTACAAG AATAGAAATT nTACCATCAT TTTTTACTGG TCCATTAAAT 
TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACCGCT ATTGTAACTA 

50 

TGTACAATAT ATACAACTAA TCCCGATAAG TAAAATTACA ACAATCAGAA GAATGATAGG 



TAATT 



4020 
4080 
4082 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA ' 2220 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 234 0 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 24 00 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 2460 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 2520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2580 

,5 TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 264 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2700 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

20 TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2 820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 288 0 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 294 0 

ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3 000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3060 

AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 324 0 

35 AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG -3300 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCna'AACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 3420 

AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 80 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 354 0 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3 600 

AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3 660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 3 7 80 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT GATGTAACAA 3 840 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3 900 
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AGGCGTCTGA 


ATAATTTGCA 


ATAAAAATGC 


TATGATTGCG 


ATAAATAATA 


TTGAATGCGT 


420 




AAAGGTTCGA 


TGTCCAAAGA 


TCAATCTCAC 


AAAAAAACTA ATTACCTTAA ACCTTCTGCC 


480 


5 


AATCTTACTT 


TGAGTGTGAC 


ATATATCGGG 


TAATAAGCTA 


GCTAGAGTU vj 




54 0 




AACCGTAACC 


GACGAAAAAA 


TATCCGTTTG 


AA7VATATTGT 


GTTv5TTAGC-ij 




600 


10 


CATGCCGCAT 


GAAGCATGTG 


TTTTACCTGT 


CATATTTGTT 


CTCCTTTAAT 


& f'T^O ft ^ A 'T*'P*T' 


660 


TACCACATCC 


CTAACAAAAA 


CACGAACATA 


TTTTCGGGTT 


AAAATTCAl. 1 


jv T* n li r* 21 


720 




AATTTAAAAA 


AGTATCACAT 


AACTCTTGAA 


AACGATTACA 


AAATCGTTTA 


Tt» AXCj i A i i i 


780 


15 


ACAAAATATT 


TAAAGGATGT 


GTTTGAATAA 


TGGCAATGAC 


AGTAAAAAAG 


GAa AAlAAlo 


840 




AAGTGCGTAT 


TCAATGGAGA 


GTTGCTGATA 


TCAAAATTCC 


TACAAGTGAA 


ATTAAAAAiA 


900 




TTACACAAGA 


CCAAGATATT 


CATGCAGTTC 


CTAAATTAGA 


CAGCAAAGAT 


GTATCTTAGAA 


960 


20 


TCGGCTCAAC 


GTTTGGTAAA 


ACGAATCGCG 


TTATTATCGA 


TACTGAAGAC 


rf^ TV z'^/"* TV TV TT* A A 

CACGAA J. AUA 


1020 




TTATTTATAC 


TCAAAATGAT 


CAAAAGGTTT 


ACAATGAATT 


AACTAAATAA 


» *ii''f»/^^TV T»R IV A 

ATTGTATAAA 


1080 




AaAATCATTC 


ATGGTGAGGG 


CTTCATGAAT 


GAl'lTTTTTA 


ATTGATTCAA 


CACCCAGCAT 


1140 


25 


AAACAAATAC 


AAAAGGACAA 


CTGTTCCCAT 


AATTTTAACA 


GTTGTCCTTT 


1 1 M 1 YV ^ TV A ^ A 

TTCACATATA 


1200 




TTTATAACAA 


AAGATGTGCC 


ATCAAAGAAA 


TAATTGGTAG 


TGTAATGATT 


GTTCTAATCA 


1260 


30 


AGAAAATCAT 


AAACAATTTG 


CCGATGCTTA 


CAGGAATCTT 


CGAACCAAGT 


ATGAt- vaL. UAU 


1320 


CTACTTCAGA 


CAAGTATATT 


AACTGCGATA 


TACTAAGTGC 


CCCAATAACA 


TV IV A R 


1380 




TATCATTTTG 


TACACCTTCA 


ATTAATATAG 


AAGGTAAAAA 


CATATCGGCA 


AAA /^/"V*^ IV 'P & & 


1440 


35 


TAATCGTTTG 


AGAAGCCTGT 


GCCGCTTCAG 


GTATTTGCAT 


TAACTCTAAA 


AATGGAACAA 


1500 




AAGGTTTACC 


CAATATGACA 


AAAAAGGGCG 


TGTAGTTCGC 


AATAATGGTA 


GCAATAGTAC 


1560 




CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGTT 


TTTAATCCTG 


1620 


40 


ACTTAAAAAA 


GTCAATAACG 


CCCGGTGCTT 


TAATACCTAC 


TTCTGTTGCA 


GTATCAAAGC 


1680 




CATGTCTCAA 


TGCCGTCTTT 


CCTTCTGGCA 


ATGCCTCAGT 


ACGCGCACTT 


TCAGGTACCT 


1740 




CCTTAGCATA 


CTCATCAGGA 


ATTTTATTTA 


AAGGCCAAAT 


TCTTGGCATA ATGACTGCTG 


1800 


HO 


CAACGAGGCA 


GGATACTATC 


ACTGATAAAT 


AGAAAGCAAA 


AAATTGATTT 


TGCATGTGCA 


1860 




CTGTTTCAGC 


AACTACAATT 


GCAAAGGTGA 


TAGAAACTAC 


ACTAAATGTC 


GTTGAAATAA 


1920 


50 


CTGTTGCCTC 


ACGACGAGAA 


TAATATCCTT 


CACCATATTG 


TCTACTTGTA 


ATTAAGACAC 


1980 


CAACAGTTCC 


GTCTCCAATA 


AATGATGCTA 


AATTATCTAC 


CGTCGAACGT 


CCTGGCAATG 


2040 




TAAATAAAGG 


TCTCATAACC 


GGTCTAAAAA 


TAGGACCCAA 


CATCTCTAAC 


AAACCGTATT 


2100 
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TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 6420 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCXX:T ACTAAATATA TTTTTGAAGA 64 80 

AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 6 54 0 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 6 600 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 6660 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 6 720 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 678 0 

y5 GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATT GATAAGCGCC 684 0 

TATGTGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 6900 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 6960 

CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 7020 

AGAGCAAATA ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 7 080 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 714 0 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 7200 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 7260 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA ATGAATACaA TGATTGATTA 732 0 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 73 80 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 74 30 

3S (2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 082 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG 60 

ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 120 

TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 180 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 3 00 
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TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 
CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 
TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 
TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 
AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 
TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 
TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 
AAAGACCTCC CAATGTTTCT GGAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 
TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 
TATCCCAGTA GCCAACX3TCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 
TAGTATTGAc TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 
ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 
TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 
GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 
CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 
TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 
TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 
TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 
TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 
CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 
ATT-finTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 
AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATTTT 
CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 
TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 
TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 
GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT TAATAATATT ATTATGATGA 
ATCAAAATAA ATCX5AACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 
TCGACATCGA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 
TTTCTGGAAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 
AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATCGAT AATAGTGTAG CACGGTATCX5 
TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 
AATAATAATA nTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 
ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATIT 
TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 
AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 
GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 324 0 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 
TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 



10 



35 



40 
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2820 
2880 
2940 
3000 
3060 
3120 



3300 
3360 

GCACX5TAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 34 2 0 



3480 
3540 
3600 
3660 
3720 
3780 



TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 
CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 
2s CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 
AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 
30 VTCPJKATTQC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3 84 0 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TATCATCAAA 3900 

GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3 960 

CAAQGTTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 4020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4080 
CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 
TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 42 6 0 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4 32 0 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 43 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 444 0 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4500 
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TCGTGTAGAT 


TATGGTGCAG 


CACATGCAGC 


AAAATATGGA 


CATGAACGTT 


ATGGAAAAAC 


1020 




ATACGAaGGA 


ATTTACAAAG 


ACTGGAAACC 


AGGACAGAAG 


GTACACCTAG 


TTGGACATAG 


1080 


5 


TATGGGCGGT 


CAAACGATAC 


GTCAACTAGA 


AGAATTACTG 


CGTAATGGTA 


ATCX5TGAAGA 


1140 




AATAGAGTAT 


CAAAAGAAAC 


ATGGTGGcGA 


AATTTCTCCA 


CTATTCAAAG 


GTAATCATGA 


1200 




CAATATGATT 


TCATCAATTA 


CTACTTTAGG 


AACACCACAT 


AATGGTACAC 


ACGCATCAGA 


1260 


10 


TTTAGCTGGT 


AATGAAGCTT 


TAGTGAGACA 


AATCGTATTT 


GATATCGGTA 


AAATGTTTGG 


1320 




TAATAAAAAT 


TCAAGAGTAG 


ACTTCGGGTT 


GGCTCAATGG 


GGTCTAAAAC 


AGAAGCCAAA 


13B0 




TGAATCATAT 


ATTGATTATG 


TCAAACGCGT 


TAAACAATCT 


AATTTATGGA 


7U\TCAAAAGA 


1440 


15 


TAATGGATTT 


TACGATCTGA 


CGCGTGAGGG 


TGCAnCAGAT 


TTAAATCGTA 


AAACGTCGTT 


1500 




GAACCCTAAC 


ATTGTGTATA 


AAACATACAC 


TGGTGAAGCA 


ACGCACAAAG 


CATTAAATAG 


1560 




CGATAGACAA 


AAAGCAGACT 


TAAATATGTT 


TTTCCCATTT 


GTGATTACTG 


GTAACTTAAT 


1620 


CGGTAAAGCT 


ACTGAAAAAG 


AATGGCGAGA 


AAACX3ATGGT 


TTAGTATCCG 


TTATTTCTTC 


1680 




TCAACATCCA 


TTTAATCAAG 


CTTATACAAA 


AGCGACAGAT 


AAAATTCAAA 


AAGGCATTTG 


1740 


25 


GCAAGTGACG 


CCTACAAAAC 


ATGATTGGGA 


TCATGTTGAC 


TTTGTAGGAC 


AAGACAGTTC 


1800 


TGATACAGTG 


CGCACAAGAG 


AAGAATTACA 


AGATTTTTGG 


CATCATTTAG 


CAGACGATTT 


1860 




AGTGAAAACT 


GAAAAGCTGA 


CTGATACTAA 


GCAAGCATAA 


TTTATAAAGT 


AAAGGGAGGA 


1920 


30 


ATTAATAATG 


ACTGCAGACT 


TCTTTCAATT 


AATCGGATCA 


TTATTTAGAA 


TTCTAAAAGA 


1980 




ATTATTCAAG 


TAAAACATTG 


GCGAGGCCCC 


AACATAAAGA 


ATTTCGAAAA 


GAAATTCTAC 


2040 




AAACAATGCA 


AGTTGGCGGG 


GCCCCAACAA 


AGAAGCTGGC 


GGAAAGTCAG 


CTTACAATAA 


2100 


35 


TGTGCAAGTT 


GGCGGGGCCC 


CAACATAGAA 


GCTGGCGGAA 


AGTCAGGTTA 


CAATAATGTG 


2160 




CAAGTTGGGG 


TGGGACGACG 


AAATAAATTT 


TGCGAAAATA 


TCATTTCTGT 


CCCACTCCCA 


2220 




TTGGCATTTA 


CX3AAGTTTAA 


ATGTGCAATT 


AGAATATATG 


TATAACAATA 


TTAAACACGC 


2280 


40 


GGTAAAACGA 


AGTCAGTCAA 


TTCAAACTGA 


TTTCGCCCAC 


CGCGTGTTTT 


TAACATAGCT 


2340 




TAATAATTAA 


TAAGCATTAA 


TGTTCAATTT 


ATATGGTTGT 


TTTCCAATAA 


TAAACCTAAA 


2400 




GATATAGAAT 


TCACGCAATA 


TCATGCCGAC 


ACCTATACAT 


AATCCTAAAA 


TGAATAGTAG 


2460 


45 


TGATATCGCT 


AGAAAGACCA 


TTGTATTATC 


CTCAAATATA 


TTTGTATATG 


CAAACAATGA 


2520 




GTCTAGAATG 


ATTGGATGTA 


ATAAATAAAT 


AAAGAATGAG 


AAAGCACTAA 


TCATTTGAAT 


2580 


50 


CGTATTAAAT 


AACATTGTTT 


TAAAATGCGT 


GCAAATACCC 


AAGATAACAA 


TAAACATAAT 


2640 


ACTATTATAT 


GGTGTTAATG 


AATATGAAAA 


GCTGGTAACG 


TTCCAATAGT 


CTCCaTTTGT 


2700 
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ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 624 0 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 63 00 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 63 60 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 6420 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCX5CCACTG CATTTGGTAC GTCATTAAAA 6480 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 6540 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 664 5 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2$ . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 60 

AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 120 

^0 AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 240 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 300 

TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACfiATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 420 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 4 80 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 660 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 780 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 84 0 

TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 900 
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TTTATTTGTC 


GGAATTTGAG 


TTACTGTCAT 


GTTATAAATA 


TTTCTAAATT 


CTTCTTCTTC 


4440 




AGTTTTAGCT 


GTACCTGTCA 


TACCCGCAAG 


TTTATTGTAC 


ATTCTGAAAT 


AGTTTTGGAA 


4500 


5 


TGTAATAGAC 


GCCATAGTTT 


TAGATTCATT 


TTGAATTTGA 


ACGCCTTCCT 


TCGCTTCAAT 


4560 




AGCTTGGTGT 


AAACCTTCCG 


AGAAACGACG 


GCCTGGCATT 


GTACGTCCTG 


TAAATTGATC 


4620 




GACAATTAAT 


ACTTCGCCAT 


CAACAACCAT 


ATAGTCTACG 


TCACGTTGTA 


ATGTAACGTG 


4680 


10 


CGCACGTAAA 


GCTGTGTTGA 


TATGACTAAT 


AACATCAACA 


TTTTGTACAT 


CATATAAGTT 


4740 




TTCAACTTTG 


AACATACGTT 


CAGCTTTATC 


CGCACCTTGT 


TCTGTTAAAT 


GTACAGCTTT 


4600 


15 


CG'i-rrrrrcA 


TCGTATTTAT 


AATCTTCGTC 


CTGTTTTAAC 


ATTTTCGCAA 


AAACATTTGC 


4860 


TTGTGTATAA 


AGTGACGTTG 


ACTTTTCAGC 


TTCACCAGAA 


ATAATTAATG 


GCGTACGTGC 


4920 




CTCGTCGATT 


AAAATTGAGT 


CAACCTCATC 


AATGATTGCA 


AAATGTAATG 


GACGCATTAC 


4980 


20 


TCTATCTTCA 


GAATAATTCA 


CCATGTTATC 


TCGTAAGTAA 


TCAAAACCTA 


GCTCATTATT 


5040 


AGTACTGTAA 


GTAATGTCTT 


GTGCGTATGC 


TTCACGTTTT 


TCTTCTGTCG 


TCTTACTGTT 


5100 




TAAGTTTAAT 


CCGACAGTCA 


AACCTAAGAA 


GTTATATAAC 


TCAGCCATTT 


CTTCACTTTG 


5160 


25 


AACACTTGAT AAGTATTCAT 


TGACTGTAAT 


AACX3TGAACA 


CCTCTACCAG 


CTAATGCATT 


5220 




TAAGTATGTT 


GGCATTGTCG 


CTGTTAATGT 


TTTACCTTCA 


CCTGTTCTCA 


TCTCAGCGAT 


5280 




ATCACCTTTA 


TGAATTGCAA 


TACCACCCAT 


AATTTGAACT 


TTATATGGTG 


TCATATTGAA 


5340 


30 


TACACGTTTA 


GAGCCTTCTC 


TAACAAGTGC 


ATATGCTTCT 


GGTAAAATTT 


TATCTAAATA 


5400 




ATCATTTTGC 


TTTTTGACAT 


TATCAATGTC 


AGCTAATTCT 


GTTTGGAATT 


GTTTCGTTTT 


5460 




ATTACGAATT 


TCTTCATCAG 


TTAAAATTGC 


CGTTTTTTCT 


TCTAAAGCGA 


TTACTTTATC 


5520 


35 


AGCAAGTTTA 


CCTAACTGTT 


TAATTTCTTT 


ATTATTGCCA 


TCAAGAATTT 


TTGATAAAAA 


5560 




TCCC^TTTCG 


TTCGCTCCTT 


TAGCTAAAAA 


ACTGTTTGGC 


CTACAACAAT 


ATATCTTATC 


5640 




ATTTATAGTT 


AGAAAATTAT 


ACTTATTTAC 


TCATTTGTAG 


AATCAATATA 


AATATATTTA 


5700 


40 


TGACATACTT 


CATTCACATT 


CTGTTGTCAA 


CAAGTTTATC 


ACTAATAAAT 


ATATTCTCAA 


5760 




TACGCAATTA 


TACTTCCTAA 


TAAATTATAT 


TATAAATATT 


TTACGATTTT 


CGACTCX3GAC 


5820 


45 


TATACAATAG 


ACTGACATAC 


TATTATTAAC 


TTAACATTCA 


AATATATACA 


TCCATTAACA 


5880 


TTAGCATAGT 


CACTATGTTT 


CATTCAACAA 


ATTACATTAT 


CGAACTATGA 


AATAGTCATA 


5940 




ATTTGCTTTT 


GGAGTATAAA 


AAAGCACTTG 


TGCAAAAACA 


CAAGTGCTTT 


AAACTTAATT 


6000 


SO 


TATTGTTCAC 


TAGTTTGAAT 


CAAGCCATAT 


TTACCGTCTT 


TACGGCGGTA 


AACGATACTT 


6060 


GTTCCATCAG 


TTTCTCTGTC 


TGTGAATACA 


AAGAAGTCAT 


GACCTAATAG 


ATTCATTTGT 


6120 
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TTGCGCTTTC 


GTTTGGTTAT 


CCCAAAAATT 


AGGTTCTGCC 


ATCATTTCTT 


CATATTCTTG 


2640 




AATATTAGTT 


TCTTTGTTCT 


CTAAGTCAAA 


GAGACCCCCT 


AATTTGTGTT 


AAATCTTGAT 


2700 


5 


TATACTTATC 


TATATTTCGT 


TTGATTTCTG 


ATAATTCCAT 


AGcATTCGCT 


CCTATTTATA 


2760 




TTTCAATTCA 


AGTCATTGAT 


TTGCATCTTT 


TATAATGCTA 


AATTTTAACA 


TAATTTTGTT 


2820 


10 


AAATAACAAT 


GTTAAGAAAT 


ATAAGCACAC 


TGACAATTAG 


TTTATGCATT 


TATTGTTAAA 


2880 


AATCAGTACA 


TTTATCATCG 


ACATATGCCT 


AAACCGATTT 


TTTAAAACTA 


AGTACATAAC 


2940 




AACGTTTAAC 


AACTTCTTCA 


CA'i-rrrrrAA 


AGTATTTAAC 


GCTTGTAAAA 


TAAAAAGACT 


3000 


15 


CCTCCCATAA 


CACAAACTAT 


AGGTGTTTAA 


TTGGAAGGAG 


TTATTTTATA 


TCATTTATTT 


3060 




TCCATGGCAA 


TTTTTGAATT 


TTTTACCACT 


ACCACATGGA 


CAATCATCGT 


TACGACCAAC 


3120 




TTGATCGCCT 


TTAACGATTG 


GTTTCGGTTT 


CACTTTTTCT 


TTACCATCTT 


CAGCTGAAAC 


3180 


20 


GTGcTTCGCT 


TCACCAAACT 


CTGTTGTTTT 


TTCACGTTCA 


ATATTATCTT 


CAACTTGTAC 


3240 




TACAGATTTT 


AAAATGAATT 


TACAAGTATC 


TTCTTCAATA 


TTTTGCATCA 


TGATATCAAA 


3300 




TAATTCATGA 


CCTTCATTTT 


GATAGTCACG 


TAATGGATTT 


TGTTGTGCAT 


AAGAACGTAA 


3360 


25 


GTGAATACCT 


TGACGTAATT 


GATCCATTGT 


GTCGATATGA 


TCAGTCCAAT 


GGCTATCAAT 


3420 




AGAACGAaGT 


AAAATCATAC 


GCTCAAACTC 


ATTCATTTGT 


TCTTCTAAGA 


TATCTTTTTG 


3480 




ACTTTGATAT 


GCTGCTTCAA 


TCTTAGCCCA 


AACGACTTCG 


AAAATATCTT 


CAGCATCTTT 


3540 


30 


ACCTTTGATA 


TCATCCTCTG 


TAATGTCACC 


TTCTTGTAAG 


AAGATGTCAT 


TAATGTAGTC 


3600 




GATGAATGGT 


TGATATTCAG 


GCTCGTCATC 


TGCTGTATTA 


ATATAGTAAT 


TGATACTACG 


3660 




TTGTAACGTT 


GAACGTAGCA 


TTGCATCTAC 


AACTTGAGAG 


CTGTCTTCTT 


CATCAATAAT 


3720 


35 


ACTATTTCTT 


TCGTTATAGA 


TAATTTCACG 


TTGTTTACGT 


AATACTTCAT 


CGTATTCTAA 


3780 




gata£gttta CGCGCGTCGA 


AGTTATTACC 


TTCTACACGT 


TTTTGTGCTG 


ATTCTACAGC 


3840 


40 


TCTTGATACC 


ATrrrixiATT 


CAATTGGTGT 


AGAGTCATCT 


AAACCTAGTC 


GGCTCATCAT 


3900 


TTTCTGTAAA 


CGTTCAGAAC 


CAAAACGAAT 


CATTAATTCA 


TCTTGTAATG 


ATAAATAGAA 


3960 




GCGACTATCC 


CCTTTATCAC 


CTTGACGTCC 


AGAACXSACCA 


CGTAACTGGT 


CATCAATACG 


4020 


45 


ACGAGATTCA 


TGTCGCTCTG 


TACCTATTAC 


TGCTAAACCG 


CCTAATTCCT 


CTACGCCTTC 


4080 




ACCTAATTTG 


ATATCTGTAC 


CACGACCAGC 


CATGTTAGTG 


GCAATAGTAA 


CGGCACCTTT 


4140 




TTGTCCAGCG 


CCTGCAACAA 


TTTCAGCTTC 


ACGTTCATGA 


TTTTTCGCAT 


TTAACACATC 


4200 


50 


ATGACGGATA 


CCACGTTTTT 


TAAGTTW^TT 


TGAAATATAT 


TCAGAAGTCT 


CAACTGCAAC 


4260 




AGTACCTAAT 


AGCACTGGTT 


GCCCTGCCTT 


GTGTTTTTCA 


ACAACATCTT 


CTACTACTGC 


4320 



55 



3NSCXX:iD: <EP 0786519A2J_> 



1056 



10 



IS 



20 



EP 0 786 519 A2 

TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA 840 

TGCTTGTTGT ACAGGTTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 96 0 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TGAATATAAT TGATTCAACG TTAAGTGATG 1020 

CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTGATTG 1080 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

ATGTTGTTGT GCATGTGCTG CATTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 1260 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AATCAAATGG 1320 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTATTC ATACCATGTA 1380 

ATTGAGAACG TTTTCAATAA TTAATTCAAT ACCTTGAAAA TCGCCATAGG TAATATTACT 144 0 

AAATGCACAC TGCATATGTT GTTTTAACAA ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

TATCTACCGA ATTGTACTTA AATATTCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 1560 

ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 1680 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

^ CATTGAGTAT GGATGGAAAA CATATGATCT AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 1800 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT 1860 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCGTATT GCCGATTCAG TTTTGTTAAT 1980 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 204 0 

TTCAATCTCT ATTTCATCAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 2100 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCGTACT AGTCGGTGTA CACCTTTTTC 2160 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

CGCTTCATCC CCAGGTAGAT AATCAACAGT TTCAACTTTA AAGCCTTTCT TCTCACAATA 2280 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 234 0 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGTGAGGC CCATCTAATA ATAATTGCAA 2400 

TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCTCTAAGT CTTCTTTCAT 2460 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2520 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT " 108 0 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTrrTCG CGGGCTTTTT 1140 

GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CrTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 1380 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 144 0 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 1560 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 1620 

20 AATTAnAAAA A ^^^^ 

(2) INFORMATION FOR SEQ ID NO: 259: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

CGAAATCATG ATTTAATGCT TTTTCATATA AGCTTTTCCA ATTAATCTTT CGTCCATGAT 60 

ATTCTTCAAC TGTTGCTAGA TATTGTGCAA TTTTAGTTAC TTTAAAGGAG TGTGCTGCAA 120 

CaTTGTGkTC mAAATATTTA AATTTTCCaG GtAATCTTAT AAGTCTTTCC aTATCTGATA 180 

ATCT^TTAAA ATATTGATGT ACACCCATTT CAATTACCTC CTCCATTAAT TAATCATAAA 240 

TTATACTTTC TTTTTACATA TCAATCAATT AAATATCATT TaAATATCTT CTTTaTATAA 300 

CTCTGATTAA ATGATACCAA AAAATCCtCT CAACCTGTTA CTTAAACAGG CTAAGAGGAT 360 

AGTCTTGTCT TGATATATTA CTTAGTGGAT GTAATTATAT TTTCCTGGAT TTAAAATTGT 4 20 

TCTTGAAGAT TTAACATTAA ATCCAGCATA GTTCATTTCA GAAACAGTAA TTGTTCCATT 480 

AGGGTTTACA GATTCAACAA CACCAACATG TCCATATGGA CCAGCAGCTG TTTGGAAAAT 54 0 

AGCGCCAACT TCTGGTGTTT TATCTACTTT AAATCCTGCA ACTTTTGCTG CGTAATTCCA 600 

so GTTATTTGCA TTGCCCCATA AACTTCCTAT ACTTCTACCT AATTGTGCAC GACGATCGAA 660 

AGCATAATAT GTGCAGTTTC CATAAGCATA TAAGTTTCCT CTGTTAGCAA CTGATTTATT 720 
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rrrAAATCAG TAATTACTTG ATGTACTAAt tCGcACCATG TGACTCATAA GTTTTAATGT 3240 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3300 

CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 3360 

TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 34 20 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 34 80 

GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 3540 

CTGTATTTGC CCTTATA ^SSl 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOI/X5Y: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 60 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 120 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGGCATG ACGCCAATAG 180 

CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACC CATAATGCCA ATTGACCCTG 240 

CACCAAATAC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 300 

GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 360 

35 TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 420 

TCTTGCTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 480 

TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 540 

TGATGCATGA CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 600 

ACTACGATAT CATCCTTCGT TGATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 

TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGITGTTA AATCTATAAT AGAATCAACA 780 

ACAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 900 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 960 
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ATTTAATTTG TTATACCAGT ATTTTACGCT TTTTCGTCTA CATATACAAA TTTATATTAA 
ATAAAGCCCA ATACAATTTA GGTTAATTAA AC7VAGTTGAT AACTATTTAA TTATTCCTTC 
ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 
CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC TAATTAArAT GCyTTGTCTT 
GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArAm CTGrACTTCA 
ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 
AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAGCA I860 

CCAACACCTA AGATTGATGA TGACACAACG TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 198 0 

KrrrTQKTPA TATTACCACC TACAGmTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 2 04 0 
AGCCCCATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATC7V 
TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 

25 TTCGTACCGT GAGAGAATGA TTGTAACGCT GCTGTGAAAA TTTGGAAAAA TCTAAAGTTA 222 0 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTGA ATACATCAAG 228 0 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

CCTTGGTAAT GTAACACTCC ATSlATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 24 00 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 24 6 0 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCXSTAA ATGTTAAAGC ACCTATAAAG 2580 

TTCM'CACTG CTGCCATTAA AATTGCCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 264 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 2700 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 2760 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTTCT 2820 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGATCTGT TTCGCTATTG 2880 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2 94 0 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 300 0 

so TTCATTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 306 0 

TATTCGATTG AATACATTTC AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 3120 
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(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



1277 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 6 0 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

?0 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 36 0 

^5 TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 420 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

30 CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 600 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACACSTTAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 84 0 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 960 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 1140 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 1260 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 1320 
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TCATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 

^ TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 
10 <A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



15 



30 



35 



40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 12 o 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 24 0 

25 TATGATTTAT TCCAACATTC ATCAACTAGA AACATTfiAAG GTTATAACGA ATTAATCCGT 3 00 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 4 20 

ATTACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 54 0 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTCGT 6 00 

AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 660 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 780 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 84 0 

TTGTTACAAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 
ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 
5^ TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGcA 
TTGATTACTG AACAAGTCAT TAAAGATAtT TTTGAAGAGGG CACAAGTGTT AAGTCACTGC 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 
AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 
TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 
AATCAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 
TCTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 
TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 
GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 
AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 
A 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTGTATTAA GATGCTCAAC TAATTTGGGT 
30 ACATATTCCG AATTTAGATT TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT 
AAAATAAATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA ACACTAAATG TTCATCATTA 
TTGGTAAACC AGCCGGAAGC ATATGCATTC GCATTAGGTT CACCAATTGT TGAAGATAAA 
35 TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 

ATCCaATGTT CCAAATCTTC AGTTGATGTC ATCATATATG CTGATGGTGT ATCCCAAAGG 
TTA^TTCAG GTTTAGAGAC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 
TGTrrrGATT TGTAATTOGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 
CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 
AATAAATCAT AGTTCATATT TXSAGTATTCA AATTCTTCTC CGGGCTTATG ATGTAATTCA 
TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT CACAGAATCT 
tCGCTTGTAA TAtCACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCGTAATA 
GTTTCATrrT GACCATTATA GTTCATTTrA AAATGAGGCA CATCTTTCGA TACGGCATCA 
TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 
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CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 
ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 
5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATCCAAA TCGAATCAGT GTGTTAAGTG 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTCAACA TATGAGTAGT GCGTATGTIT 
CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 
TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTriT 
CTCGTACGAA TGGCTTCTCA CTrAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 
CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTOGTG ATCAGTAAAT 
TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 
AAAGTTACCG TCAAGGTCAA TACTACCGTT TTCATATTGT AGATCGTAAA TTAAACATTC 
AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 
TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 
TGATTGGCXSA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 
25 CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 

AAQATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 
TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 
CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 
CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 
CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 
TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 
AAACSTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 
AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATC 
CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATCA 
TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATCAG 
GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 
GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 
TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 
50 CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 
ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 
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AAAAAAATCT 


TCGAACATTC 


AGGACATGCA 


CCGCATATTG 


AAGAACCAGA 


AGCATTTATG 


360 


AATTATTATT 


TAAAATTTTT 


AAAAAGCGTA 


TCATAATATG 


TGATATATAA 


ACCTAGGGCA 


420 


TAAAGTCCTT 


AGGCAATGTG 


AAAAAGCTGA 


TTACTATTCA 


TTATTTGATA 


GAAATCAGCT 


480 


'ri-rrrTGAAA 


TGTATTTGAT 


ATATACTGCT 


CGTTATGCX5G 


CTATCTTCCT 


TATATTAAGT 


540 


GCCATTAGTG 


CAAAACCTCT 


TAACAATTAG 


GTAAAAAGAG 


CATAAAAAAA 


GGAAGTTTAA 


600 


TAGAATGTAT 


CATCTATCAA 


ACTTCACCAA 


ATTGCGCTAA 


ACAAAATTAT 


AGTTCAATTT 


660 


CX3TTGTTTGC 


TTCAGTGATT 


CGTTTATTTA 


CTCGACTCAA 


TTVATGATTCG 


ATTTTTTTAC 


720 


GTTGTTGTGC 


ATTAACAAGA 


ATTAATACAG 


TTCTTTCATC 


ATGCTCATTA 


CGTTTTTTAT 


780 


CGAAGTAATC 


TTCTTGAGAT 


AAAATTTTAA 


CTGCTTTAAC 


AACTTGTGGT 


TGTTTGTAGT 


840 


TTAAATGATT 


AATAATATCt 


TTAAGATAGT 


ATTCTTTCTC 


TTTGTTTTCG 


CTGATGTATG 


900 


TCAATACAGC 


GAATTCTTCA 


AAGCTAATTG 


AnAATTCCTT 


TTTAATTAAA 


CTTTTTAATT 


960 


TGTCAGCATA 


AGTGACCATT 


GATAACAACT 


CAAAGCAATC 


ATTGATTTTT 


GTAATTGCCA 


1020 


TGTTTAAAAC 


CTCCCTATTT 


GATGCATCTT 


GCTCGATACA 


TTTGCCCCGA 


TAATATAnTG 


1080 


TATCTAATCT 


TTATGnAT 
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(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
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50 



CCAGGTAAAA 


TTGTGCAATC 


ATTTGACGCA 


TTAATGGACG 


CCTTGGACAA 


TGAAGATTAT 


60 


GAAGGAGAAA 


AAGTCATTCC 


ATTCTTAGAT 


AAACATTTTA 


AATATCAAGA 


TGGCCGATCA 


120 


AGTGAGCGTT 


TAGTCAGAAA 


TTTATTTGGT 


AGCTAAGTTT 


ATATAGTAGT 


CAAAGTGGGA 


180 


GAGGTATAAT 


GATGAAATTT 


TCAGTAATAG 


TTCCAACATr 


CAATTCAGAA 


AAGTATATAA 


240 


CAGAATTACT 


TAATAGCCTT 


GCGAAACAAG 


ATTTTCCGAA 


AACTGAATTT 


GAAGTGGTTG 


300 


TAGTTGATGA 


CTGTTCAACA 


GATCAAACGT 


TACAAATAGT 


TGAAAAGTAT 


CGCAATAAAT 


360 


TGAACTTGAA 


AGTAAGTCAA 


CTCGAAACAA 


ATTCTGGTGG 


TCCAGGTAAA 


CCTAGAAATG 


420 


kGGCGTTAAA 


ACAAGCAGAA 


GGTGAATTTG 


TATTATTTGT 


GGACTCCGAT 


GACTATATAA 


480 


ACAAAGAGAC 


TTTAAAGGAT 


GCAGCAGCAT 


TTATTGATGA 


ACATCACTCA 


GATGTCTTAT 


540 
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TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT 120 

ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATGGCTG ATGATGAATT leo 

TAAAGTATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 24 0 

GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 300 

TTGTTTAGAG CCTTTCATTA TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 3 60 

AATAGTTATT TTTGTAATTC TAAACCTTTT TCATCGCGAA AACAATTAAA. TAGGTCGCGG 42 0 

TATTAATTAT TATATTATTA CCGCTTAATA TGAAAAATAC ATGAaAATTA ATTTTCTAAT 480 

ATACTTTTGA AAAATTATTA CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 54 0 

GTTTCAAAAG GGCTGATAAG CATAAAATGA AATGTAATAT TTCX3ATGTAT AAAATTTTTA 600 

ATTAGCTAAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC GGACATTTTA 660 

20 ACGACACATA TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 720 

CX3TCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC GTTGTATCTT 780 

TATCCAAATG ATAAACCTCG CTAGTTAAAT AATCTTCAAT TGCTTTCACT TTGAGTCCTT 84 0 

2S GAATCTCTTG CGGTGGATTT GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 900 

CrrCAAATCC TTCTAGCGTA TGCGgAATAA TGTATCCTCA TGTCTACCAA CAGTTTGATA 960 

AATCTGTTCT AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATTT 1020 

TATAATGAGT GGCACAATTT GTACGGCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 1080 

ACCGTAACTC TCTTCAAATG CGAAAATCAT ATTTGATGAT CATCCAGTTG TCTTATTTCC 114 0 

TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT ATA 1193 
(2> INFORMATION FOR SEQ ID NO: 253: 
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ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 ^^i> SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG TATATnCCTT TTATTCTCCT 60 

AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT CCAAAAACCA AnGAATATAT 120 

nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT CATTAATTTT GCATAAAGAT 180 

GAACCTATGT ATTTAAAAAA ACGTACATGT GTACCTACTT TGTTAATTAA TGGGGAACAT 24 0 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 





CCAGGGTGCG 


GAAAGCTTTA 


AAATTTGGAC 


TAATAAAGAT 


GCTGATATTA 


ATTCTATGAA 


60 


15 


AACAGCAGTT 


TTACAACAAT 


TAAAAGGAGA 


ATAACATATG 


CTTACTGGcA 


AACAAAAAAG 


120 


ATACTTAAGA 


AGTTTAGCAC 


ACAATATTGA 


TCCGATTTTT 


CAAATTGGAA 


AAGGCGGTAT 


180 




CAACGAAAAT 


ATGATTAAAC 


AAATAGATGA 


TACGTTAGAA 


AACAGAGAAT 


TGATTAAAGT 


240 


20 


ACATGTACTA 


CAAAATAACT 


TTGATGATAA 


AAAAGAATTA 


GCTGAAACAT 


TAAGCGAAGC 


300 




TACTCATAGT 


GAATTAGTGC 


AAGTGATTGG 


ATCTATGATA 


GTGATTTATA 


GAGAATCTAA 


360 




AGATAATAAA 


GAAATTGAAT 


TGCCATAATA 


ATGAAAAAGA TAryACTTTA 


CGGCGGTCAG 


420 


25 


TTTAACCCTA 


TCCATACTGC 


ACATATGATA 


GTAGCTAGOG 


AAGTATTTCA 


TGAATTACAG 


480 




CCAGATGAAT 


TTTAlTT'l-rr 


ACCTAGTTTT 


ATGTCTCCAT 


TGAAAAAGCA 


CCATGATTTT 


540 




ATAGACGTTC 


AGCACAGATT 


AACAATGATA 


CAGATGATTA 


TCGACGAGCT 


TGGTTTTGGA 


600 


30 


GATATTTGTG 


ACGATGAAAT 


TAAACGTGGT 


GGTCAAAGTT 


ATACCTATGA 


CACGATCAAG 


660 




GCATTCAAGG 


AGCAACACAA 


AGACAGTGAG 


TTGTACTTTG 


TTATTGGGAC 


GGATCAGTAT 


720 




AACCAACTAG 


AGAAATGGTA 


TCAAATTGAA 


TACTTAAAAG 


AAATGGTTAC 


TTTTGTAGTT 


780 


35 


GTAAATCGAG 


ACAAAAATAG 


TCAAAATGTT 


GAAAATGCTA 


TGATTGCAAT 


TCAGATACCT 


840 




AGGOTAGATA 


TAAGTTCGAC 


AATGATTCGA 


CAAAGAGTTA 


GTGAAGGGAA 


ATCTATCCAA 


900 


40 


GTTCTTGTTC 


CTAAATCCGT 


TGAAAACTAT 


ATTAAGGGGG 


AAGGATTATA 


TGAACATTGA 


960 


AAAA 












964 




(2) INFORMATION FOR SEQ ID NO: 252: 









(i) SEQUENCE CHARACTERISTICS: 
''^ (A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCX3TAG AGCTAGTTAA 4 860 

CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 4 92 0 

TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 4 980 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 504 0 

AACATTGAAC AAGTAATGTA AACCACCATT AATTTGTAGA GCACCCATAC CTAACGAAAC SiQO 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 516 0 

TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 5220 

ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCATGGAAT 528 0 

CCCCAATGGA AAAATGTTGA ACGTAQAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 534 0 

GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 54 0 0 

ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA TTGTATTAAA CTCAGGTTTG 54 6 0 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 5520 

AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATCCACATT 5580 

25 TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCACCAAG TAATACGACT 5640 

ATAGCAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 5700 

TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 5760 

30 GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT CACG 5804 
(2) INFORMATION FOR SEQ ID NO: 250: 
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35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doioble 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTTAACT GGTTTGACAA TATTTAATTT 60 

TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 120 

AAAGAAGAAG ACAATCCAAT TACTTGCCAA ATCaATCATT AAATATTCAC TATCGTATTT 18 0 

GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA ATGTCTCCAC 24 0 

5^ ATCCCTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 300 

CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 36 0 
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GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 3060 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3240 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3 300 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3 3 60 

TAATTGAGAA TAATTATCAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 3420 

CATTGAGCAG AAAGAAAATT ATGGCACCT^ ACTTTAATAT TTTTTTCAAT GTCATTCTTT 34 80 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 354 0 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

25 CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3 840 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3 900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCG TAGTTTAATA ACGAAATTCT 3 960 

30 CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CAAACAAGTA GTATATCTGT 4 020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 

TAG<»ATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACCATCATGA GTATGACGAC 4260 

AAAGSgAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 43 80 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 444 0 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4500 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 456 0 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4 620 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4 680 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4740 
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TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 1260 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTCT TATTGTATAC 1320 

GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 1380 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 1440 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 1560 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 1620 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1680 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 174 0 
TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 
20 TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 2040 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 2160 

TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 2220 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 2280 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 2400 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 24 60 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC T^TACCACCA ACACCCCATC CTAGTACGCC 2520 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2580 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 264 0 

^5 GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 2700 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 2760 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

SO TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 80 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAAC7VC GAGAAGGTTT 294 0 

55 



3NSOOCID: <EP 07865 19A2_I_> 



1042 



EP0 786 519 A2 



10 



IS 



20 



35 



40 



45 



TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACX3TGCCGAA ATCAATGAAA 720 

GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 780 

CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 84 0 

ACATTAACAC AA 8^2 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQXTENCE CHARACTERISTICS: 

(A) LENGTH: 5B04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CACTTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 60 

CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 12 0 

GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 180 

25 TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 24 0 

CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 300 

TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 360 

GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 42 0 

TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 480 

TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA CCGTGACGAT AGTAATCCAT 54 0 

TTCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 600 

CTTAjSCAGTA ACTTTGACGT AGtCATGTGG TTGAACATTT TCATCAATAT TAACAGrAAT 660 

TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 720 

TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 780 

TGTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 84 0 

ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 900 

ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 96 0 

TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 1020 

50 ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 1080 

ATATTTACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 114 0 
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AV- 1 Av:»VjTATA 


CATGATACTA 


CTAAATACAT 


TAAAGAATTA 


TCCGGCGGAC 


AACAAAAACG 


360 






GCTAAAACAT 


TAATAGAACA 


ACCAGATTTA 


TTGTTATTAG 


ATGAACCTAC 


420 


5 




GACTTCGAAT 


CAATCAGCTG 


GTTGATCAAT 


TATGTGAAGC 


AATATCCTCA 


480 






TTCGTAACCC 


ATGATCGATA 


TTT'l-lTAAAT 


GAAGTTTCCA 


CTAGAATTAT 


54 0 


10 


IvjAfltL- J. /\AAC_ 


AGAGGTAAGT 


TAGCGTCATA 


TCCTGGTAAC 


TATGAATCTT 


ATATTGAAAT 


600 




VjC Cj wG CTGAA 


AGAGAAGTAA 


CACTTCAAAA 


GCAACAACAA 


AAGCAACGAG 


CTTTATATAA 


660 




GGAAGAACTT 


GCTTGGATGA 


GGGCTGGgAG 


CTAaGGCTCG 


TACTACAAAG 


CAACAAGCTA 


720 


15 


tjAATTAATCG 


ATTTAATGAC 


CTAG/UnAATG 


AAGTTaACCA 


GCAATATAAA 


GACGATAAAG 


780 




GTGAATTGAA 


TCTTGCTTAT 


TCaAGATTAG 


GTAAGCAAGT 


GTTCGAATTA 


GAAGACTTAT 


840 




wiAAviijC 1 AT 


TAATGATAAA 


GTATTATTTG 


AACATCTGAC 


GGAAATTATT 


CAAAAmGGTG 


900 


20 


AGCGTATTGG 


TGTTGTTGGG 


CCAAATGGAG 


CTGGTAAAAC 


AACACTCTTA 


AATATTTTGA 


960 




GTGGAGAAGA 


CCAACAATTC 


GAAGGTAAAT 


TGAAGACTGG 


GCAGACGGTT 


AAAGTAGCTT 


1020 




ATTTTAAGCA 


AA 










1032 


25 


(2) INFORMATION FOR SEQ ID NO: 248: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


248 : 






TGTGATTAAC 


GAAGCTTATT 


TTCGTACACC 


TTCAACAACT 


GATTACAACG 


GCGTTTATCA 


60 




AGGTPATTAT 


ATTGATTTTG 


AAGCAAAGGA 


AACTAAAAAC 


AAGACGTCCT 


TTCCTTTAAA 


120 


40 


TAATATTCAT 


GACCATCAAG 


TCGAACATAT 


GAAAAATGCA 


TATCAACAAA 


AAGGTATTGT 


180 




GTTTTTAATG 


ATTCGTTTTA 


AAACGCTAGA 


TGAAGTTTAT 


CTAUTACCCT 


ATTCAAAATT 


240 




CGAAGTATTT 


TGGAAGAGAT 


ATAAAGATAA 


TATTAAAAAG 


TCTATAACAG 


TTGATGAAAT 


300 


45 


ACGAAAAAAT 


GGTTACCATA 


TTCCTTATCA 


GTATCAACCA 


AGATTAGACT 


ATCTAAAAGC 


360 




AGTTGATAAG 


TTGATATTAG 


ATGAAAGTGA 


GGACCGCGTA 


TGACGGAAAA 


CAAAGGATCT 


420 




TCTCAGCCTA 


AGAAAAACGG 


TAATAATGGT 


GGGAAATCCA 


ACTCAA7VAAA 


GAATAGAAAT 


480 


50 


GTGAAGAGAA 


CGATTATTAA 


GATTATTGGC 


TTCATGATTA 


TTGCATT-I'TT 


CGTTGTTCTT 


540 




TTACTAGGTA 


TCTTATTGTT 


TGCTTATTAT 


GCTTGGAAAG 


CACCTGCTTT 


TACCGAAGCT 


600 
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CGTTAATTTT 


TATTAGAATG 


GTAAGTCATC 


ATCACTTATA 


TCAATCGGTC 


CGTTTGCATT 


2340 


TGCAAATGGA 


TTATCAGATT 


GTTTCGTGTT 


TGATGAATTA 


TTGTACGAAT 


TGTTTTGTCC 


2400 


TGATTGTTGA 


CCACCGAATC 


CTTGACCGTA 


ATCTTGGAAT 


TCATTTTGTT 


GACGTTGGCC 


2460 


ACCATTTTGT 


TGCGCATTTT 


TAGGTTCAAG 


GAATTGAACG 


CTATCACACA 


CAACTTCAGT 


2520 


AACAAACACA 


CGACGACCTT 


CTTGATTTTC 


ATAATTACGG 


GATTGTAAGC 


GACCATCTAC 


2580 


ACCAGCTAAA 


CTACCTTTAG 


ATAAATAGTT 


ATTTACATTA 


TCTGCTTGTC 


TTCTAAAAAC 


2640 


AACACAGTTA 


ATAAAATctG 


CCTCGCGCTC 


CCCTTGAGCA 


TTCGTGAACG 


TACGATTTAC 


2700 


TGCAAGAGTG 


AATGCCGCTA 


CACTCACACC 


TGAGGGAGTG 


GTTCTGTATT 


CCGGATCTTT 


2760 


CGTTAAACGA 


CCTACTAATA 


CAACTCTATT 


TAGCATTTAA ACGCCCCCTC 


TAATTATTAC 


2820 


TTGTCTTCGT 


CTTCACGAAT 


AACCATGTAA 


CGAATGATAT 


CGTCACTGAT 


TTTAGCTAGA 


2880 


CGTTGGAATT 


CGTCAGTAGC 


TTTGTTGTTA 


TCAGATTTAA 


CACGTACGAT 


GTTGTAGAAG 


2940 


CCATCTTTGA 


AATCATTGAT 


TTCATAAGCT 


AGGCGACGTT 


TACCCCAGTC 


TTTTGCTTCT 


3000 


AAAACTTCTG 


CACCTTCAGT 


AGCTAAGATA 


CCGTTGAAAC 


GTTCAACTAA 


CGCTTTTTTA 


3060 


GCATCTTCCT 


CAATGTTTGG 


GCGTACGATG 


TACATAACTT 


CATATGTTCT 


CATTTTATAT 


3120 


TTGCACCTCC 


TTGTGGTCTA 


TACGGCTTAT 


CAATCTTAAA 


ACAGATAAGC 


AAGGaATAAT 


3180 


TTTCATTACT 


CACAATAAAG 


AATTaTATCA 


TGCGCCATTA 


CTTTTTACAA 


TaATAATTcA 


3240 


AACTACTCTT 


CATATCATTT 


TTGATATtAA 


TTCATTTGaA 


ACTTTCnATG 


ATATTTTnAA 


3300 


AAATACACTT 


CACAAAAGCG 


AACATATGTn 


CTATAAnAGT 


TGTGAGGTGG 


TAAGGAATGA 


3360 



ATTTA 

(2) INFORMATION FOR SEQ ID NO: 247: 

^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 247: 

45 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 
ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 
50 TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 
ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 
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TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG ' 54 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 60 

CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 66 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 721 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 781 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 84 ( 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 90( 

TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 96C 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 102C 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 108C 

20 ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 11 4 c 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 120C 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 1260 

ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AGAGTTGCTA 138 0 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 144 0 

TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 156 0 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 1680 

GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 1740 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT I860 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCTT 1920 

^5 TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTATG CcTACGGGgT 1980 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 204 0 

AGAACX3TTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 2100 

50 ACX3TGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 
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CACCATTGTT 


AAGTCTGTTT 


AATGACTTAA 


AAATAGATAG 


ACGTTTAATC 


GGTTTGATTA 


480 


TCGGTTTTGG 


TTTATGTTTC 


CCGTATGTGT 


TATTACCATA 


TGGATTCGGT 


CAAATTTTCC 


540 


AGCAAATTAT 


TCAAAGTGGC 


TTTGCAAAGG 


CAAATCACCC 


AATTGAGTTT 


AATATGATTT 


€00 


GGAAAGCAAT 


GCTTATTCCT 


TCAATGGGGT 


ATATTGTTGG 


CTTACTTATC 


GGTTTATATG 


660 


TATATCGTAA 


ACCACGTGAA 


TATGAAACAC 


GTAAAATTTC 


AGATAGTGAC 


AATGTTACAG 


720 


AGTTAAAACC 


ATATATCTTA 


ATAGTAACAA 


TTGTAGCAAT 


ACTAGCTACA 


TTTTTAGTAC 


780 


AAACATTTAC 


AGATTCAATG 


ATTTTTGGTG 


CACTGGCAGG 


GGTACTCGTA 


TTCTTTATTT 


840 


CACGTGCATA 


TAATTGGTAT 


GAATTAGATG 


CTAAGTTTGT 


TGAAGGTATT 


AAAATTATGG 


900 


CTTATATTGG 


TGTAGTTATT 


TTAACAGCAA 


ATGGATTTGC 


TGGTGTAATG 


AATGCTACTG 


960 


GTGATATAGA 


TGAATTAGTT 


AAAACTTTAA 


CAAGTATTAC 


TGGTGATAAT 


AAATTATTTA 


1020 


GCATTATCAT 


GATGTATGTG 


ATAGGTTTAA 


TTGTCACTTT 


AGGTATTGGA 


TCATCATTTG 


1080 


CAACAATTCC 


TATTATCGCA 


TCATTATTCA 


TTCC"n''i'TGG 


AGCGTCAATT 


GGACTAGATA 


1140 


CAATGGCATT 


AATCGCATTG 


ATTGGAACAG 


CGAGTGCATT 


AGGTGACTCA 


GGTTCGCCTG 


1200 


CAAGTGATTC 


AACATTAGGA 


CCAACTGCGG 


GATTAAATGT 


TGATGGCCAm 


CATGATCATA 


1260 


TACGTGATAC 


ATGTGTACCA 


AACTTCTTGT 


TTTATAATAT 


TCCTTTAAAT 


GATTTTCGGT 


1320 


ACTATTGCTG 


CTATGGTACT 










1340 



30 <2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 
{B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 



45 



50 



CAAAATCTAA 


GAGAATAAnA 


TTTGTTAATT 


TnAAATAGCA 


AGCAATTCAA 


AGTTATATGT 


60 


GTAATAGATA 


AAATAGATAT 


CCCTATAGTG 


ATGCGTTACT 


AGCTAAACAT 


AATAACACAT 


120 


TAGAAGATAA 


TGAAGTTAAG 


GAGTTACTGG 


ATTGTTTCGA 


CTATGTAATT 


AAGTATAAAA 


180 


ATATCCAACG 


ACAAAACGTA 


ATTATAAAAT 


GGTAAAAGCT 


ATGGTACAGT 


TTCAAATTGC 


240 


TAATGACATG 


CGTATCGGTG 


AGCTACTTGC 


AATAAAGAGA 


GTAAATATAA 


ACTATGAAGA 


300 


TAAAACGCTA 


GATATCGACG 


GTAAAGTTAA 


TTGGATAACT 


6AAAAAAGAC 


GGGAGCATTC 


360 


GGAGTAAAGG 


AGACAACTGA 


AAGAAGTAAT 


AGCTATAAGG 


CCACAGGGCT 


CACTACCCAA 


420 
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TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 
CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 

5 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 

ITTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTITTTCA AAATTAATGT GACCATATGA 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACC7VTTTAAA GAAAAATCTG GATGCTCTTT 

TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAGCATA 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 

CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 

CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC 

<2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 134 0 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 60 
45 TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG 120 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 180 
TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 24 0 

TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATX3CA ATTCACGCTG 300 
AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 360 



1740 
1800 
1860 
1920 
1980 
2022 



30 



SO 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 984 0 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 9960 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 1014 0 

AGCTTC 10146 



(2) INFORMATION FOR SEQ ID NO: 244: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 
<D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 



25 


ATTTAATTGG 


TTGGTGGCGT 


ATTCTCnTTT 


CCAAGATTTG 


CCACGATTGG 


ATGTTCAGTC 


60 




GACAAATATG 


TAACCAGTCC 


CGATAAATTA 


CGTTCCTCAA 


TCATTGATAG 


TATGTCTGGT 


120 




GTTTGTAACG 


ATTTATCGAG 


TAAGGCATCA 


AGCAATTGAT 


AATGTCCCAA 


TACAACAAAA 


180 


30 


TGCACGTTGT 


CTCTCAACTG 


CTGTTGAATA 


AACTGAATAA 


AGAGCTTTAA 


GCTCTGTTGC 


240 




ACATTCGCTA 


ATGATGGTGC 


ATAGTTTTCC 


AAACCAACTT 


GTACAGCCGC 


TTCATTATTC 


300 




CGAATGATTA 


AACCTGTGTA 


TGCCACTTTT 


GTTGCTGCAG 


TTGGATACAT 


TGAGTAATAA 


360 


35 


CGCAATAATT 


GATCTGTAAA 


ATCATTTCGA 


AGTGCATAAA 


TTTGATGCTC 


ATGTTGCCAA 


420 




AAATftCGCT 


CACCCATCTG 


CTgCAAATCC 


TCATGGTTCA 


ATTGTTTCCA 


GTCCAACTTT 


480 




TCAACCACAC 


TAAAATCAAC 


TAACTCATAA 


TCCGCTTTAT 


TAAAATATTT 


TAAAAATGCT 


540 


40 


GTTTCCGATT 


CTTTTAACGC 


AATTAATTGT 


TCTGnATTAT 


TCACTCGACC 


ACCCTTTACT 


600 




TTCAATACTG 


TATTTAAAAT 


CACTTGGTAT 


TTTCGTTGTT 


TGCTTTACTT 


CTCTACCACG 


660 
720 


45 


CTAAAGTGTA 


ATATGATTAA 


TAACTTATCA 


TTTTTAGCAA 


TACATTACAA 


CCTTTTTCAG 


AAAATTCGGT 


GTATTGATTT 


TAAAATTTTT 


TAAAATAnAA 


AAGGCAAGAC 


ATTTGTGCCT 


780 




ATAAAAATGC 


TTAACCAAGA 


TTTTTATATT 


GaAGTTGTAC 


TTCTTGCACA 


TATTGTCCTT 


840 


50 


GCCTTATTAT 


GTAAAGTTAT 


TTTCTTTCTA 


TCrriTTATT 


AAATTTAACT 


ATTCTTCATA 


900 




ATCCCGATTC 


CCTTTAAAGT 


AACGTCTATC 


TTGTTTACTA 


TATACATTTT 


CAGGATTAAA 


960 



55 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 8 04 0 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT 8100 

CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTGA TCAAGCTATT TATAATATAA 8220 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC B280 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT ACATATTCAT 834 0 

ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTGATGAA GATCCTTATG 84 00 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 84 6 0 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 864 0 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8 700 

CACGATACGG TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 8760 

2S ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 882 0 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 88 8 0 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT GTGCTAATGT 8 94 0 

ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 90 00 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 90 6 0 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 912 0 

AACATATTTT TGTCX^AATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 918 0 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 924 0 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 93 00 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 93 6 0 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 9420 

AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 94 8 0 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 954 0 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 96 0 0 

SO TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA 966 0 

AAATATATGT TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 972 0 
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TTACATCTTG 


AACAGTAATC 


GTTTTGTTAA 


GCAATGTCTC 


TAATGAGGCC 


ATACAAGATG 


624 0 




GTTCAATTTC 


AGGATATTTA 


AATTTAGTCA 


CTTCACCTTT 


TAAAGCATGT 


TCATAAAATG 


6300 


5 


TTTGCATCAT 


CAATGCACGT 


TCTGAACCAG 


AGCCTTCAAC 


ACAAAGATAA 


ATTTGTACAG 


6360 




CAATACCGCC 


TCTAACTCTT 


CGTTGCGATA 


TGCCTGCAAA 


TTTCTTACCA 


TCGATACTTA 


6420 




AGTCAAATTT 


TCCTGGGCAA 


TAAGAATGTT 


CAATTTCCAT 


CGTATCAATA 


TCAACATTCT 


6480 


10 


CATTTTCGAA 


CATTTTGCTA 


ATTAAGAGGT 


ACATCACAGT 


AAACGCTTCA 


TCAATCGTTG 


6540 




TTTCTGTTTG 


TCCTTTGAAC 


ATCAGCGATA 


TATTTAATAC 


ACCTTGATCT 


AGAACGACAC 


6600 


IS 


CTAAGCCACC 


AGAATTTCTA 


ACAATGGCAT 


TATAACCAAT 


CTCATTCGTT 


AAATAATCAA 


6660 


TGCCATCTTT 


TAAAAACGGC 


AATCTTGAAT 


CATGAATACC 


AAGAATAACA 


GTATGTTGAT 


6720 




GAATCCAAGT 


ACGCACAACA 


TTATCTGATA 


TATCTTTGCC 


CACACTTTCG 


CAAAATGTAT 


6780 


20 


CATCGAATGC 


GAAAGATTGC 


ATAGGTTCTA 


ATCCAGAAGA 


ATGATCGATA 


TATCGCCAGT 


6840 


TGACGCCATT 


AAAATATTTA 


CTCGCTAAAT 


CCATCGTTAT 


TGTAAGGCTT 


GCGCTGCTGT 


6900 




AATAATTGAA 


AGATTGTATA 


CATCTTCAAT 


TGAGCAGCCA 


CGTGATAAGT 


CATTTACTGG 


6960 




AGAATTTAAA 


CCTTGTAATA 


CTGGACCAAC 


TGCATCATAT 


CCACCTAAAC 


GTTGTGCAAT 


7020 




TTTGTAACCA 


ATATTACCAG 


CTTCTAAACT 


TGGGAATACA 


AAGACATTTG 


CATCACCTTG 


7080 




TAATTTAGCA 


CCTGGCGCTT 


TTTTCTCAGC 


AACACCTGGT 


ACAATCGCAG 


CATCAAATTG 


7140 


30 


GAATTCGCCA 


TCAATGATTG 


CTTCTAATTT 


TtCTTCTTCA 


GCriTTtCTT 


GTGCTAATTT 
TTGAAAAGCT 


7200 
7260 




GACAGCTTCT 


TGAACi-rriG 


TCACGTCGTC 


TGATTTAGCA 


GACCC"I^r'rTG 




TAACATTGCA 


ACTTTTGGAT 


CCATGCCAAA 


GCTTAATGCT 


GATTTTGCAC 


TTTCTACTGC 


73 2 0 


35 


AATTTCTGCA 


AGTCCTTGTG 


AATCAAGTTC 


TGGATTGATT 


GCACAATCAC 


CAAAGATGTA 


7 J o 0 




TTGTTCATCA 


CCTTTAATCA 


TAAAGAAGAT 


ACCTGATGTT 


CTTGATACAC 


CTGGTTTCGT 


744 0 




TTTGATGATT 


TGTAAAGCTG 


GACGCACAGT 


GTCGCCTGTT 


GAATGTGCTG 


CACCACTAAC 


•7 e A ft 


40 


TAAACCATCT 


GCTTTACCAG 


CATAAACAAG 


CATTGTACCG 


AAGTAGTTCA 


CATTGTTTAA 


C £ ft 




TAATTCTTGT 


GCTTGTTCTT 


CAGTCGCTTT 


ACCTTTACGT 


CGTTCAACAA 


ATGATTGAAC 


^ O ft 

7620 


45 


TAATTCAGCT 


TTCAATTCAC 


TTGTCGCAGG 


ATTAATTAAT 


TCAATATTAG 


AAAT AT CAAG 


O ^ O A 

VboO 


ATCAAGTTTT 


TGCGCTAAAG 


ATTGAACCTT 


AGTCTCATCA 


CCTAACACGA 


TTGGTGTAAC 


7740 




ATAATCTGTT 


GCTTGTAATT 


GTGTTGCAGC 


TGTTAGAACA 


CGTTCGTCCT 


CTCCTTCAGG 


7800 


SO 


TAATACGATT 


TTAACGTTTT 


TACCAGAAAG 


TTTGTCTTTT 


AATACATTTA 


ATAAATCAGC 


7860 




CATAATGTCC 


TCCTGTAATA 


TAAATCTTAT 


TAATCATTCA 


CGGTATAATT 


ATACGCCATT 


7920 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 
TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA TATTCGTATG 

5 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 
GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 
10 CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 
AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 
CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 
CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 
CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 
20 GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 
CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 
CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 
ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 52 8 0 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 534 0 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 
CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 
CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 552 0 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 55 80 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 564 0 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 5700 
TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 576 0 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 
45 CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 5880 

TTTTCT-TATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGITCAAT 594 0 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 
CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 
TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 
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5160 
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10 



15 



20 



25 



30 



35 
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TGATGTTTTA 
AGTTTCTATA 
CACCATCTTT 
ACAACGATGT 
CACTTCGCTA 
CATATTTTCA 
TTCTTCAACC 
CGCTAGCCAT 
CTTCATATTT 
AAATTCATTT 
TTTATGACCA 
TATATCGCAA 
TTGTTTTGCT 
TGGGTTATGA 
AGTAGCAGTT 
AGCATATTCT 
TCCTTACTCA 
ATCAAACTGT 
TTTCACATTA 
GTGAACAAGC 
TCCTAGATTC 
AAAATCTTTG 
CCAATATTGA 
CTTTTTAGAA 
CX3ATTCAAGT 
TCCACCATAA 
CTTATCTGAC 
GCTTGCCGAT 
ATACCAATCG 



GATGCGCCGT 
TCAACTGTAG 
TGCACACCTT 
GAATCTTCTA 
ACAAAGTGTG 
GGTGCTTGTA 
GTAGTATCTT 
CCACTATATA 
GCAATCACTG 
AATACTTTTA 
TTTGAATCAT 
CTTTTCGCGT 
GcATGTGGAT 
TGTAATGCTT 
ACAAAACGAT 
CCAGCAATAT 
ATTATTTCAA 
GTTAATAATT 
GGTCCCGCAT 
GCCATGACAT 
GTGGCATGCA 
TCTTGAATCG 
TAAAACCTTG 
TGTTGATTAA 
GGAACGGCAT 
ATACTACGCG 
AGCTGCATGT 
GAAGCCAACC 
ATGCCAGCTC 



GATACTTTTC 
CTTCTTTATC 
TAATGTTATT 
AGAAGTCACC 
GTGATGACGC 
ATGGTTCGAT 
CAATTTGATG 
CACTCACAGC 
CTAGTTTATA 
TAACTGACAC 
CTAAATTACT 
ATTGTTCAAA 
CTGAAATGAC 
TTGAATGAAT 
CTAACGCAAT 
AAAGTTTTCC 
TTCCTGTGGC 
TATCTATAAT 
CCATTGTAAA 
CATAACTTTC 
TACGCAAACC 
CTGCTTTTGC 
ATGTGTTTCG 
TCACAACAAA 
ATGACGTCTC 
ACGCAGAACC 
CTAGCGCTTG 
CTGCTGCTGT 
TATTTCTGAC 



AGCAATATCA 
CATACGTTGA 
TGTTTTAAAA 
GTAAAATGAA 
CGGTGAGCCA 
GTGCAATCCA 
CTTAACCCAT 
AATATCTCCG 
AATGTATAAA 
AAGTACTGCT 
ATCAATAGTC 
TATTTCAATA 
AATACTATCT 
GGTACCTTTA 
AAGTACAGAT 
GGGTGCTTTG 
AATAATGTCA 
CTGTTGCTTG 
ATAACACGGA 
TTGCACAAGA 
ATTTTCTTCA 
TTCAGCTAAA 
TGTCAATGAC 
TATCATGGCA 
ATCACTATAC 
CGAACCAATT 
ATTACAAGCT 
TGGTACAAAA 
AATATCCATA 



CACAAATATT 
ATAATTGTAC 
GCATGAATAA 
GGATCTGATT 
GTCCAACCGA 
GGCCAGTTTT 
TCATGATCAA 
CATGAACTTA 
TTAGATAACT 
GCACTTGAAC 
AGATGAAAAT 
GCTGTGaCCA 
TCATCTCTAC 
TATTGGTCTG 
TTATATCCTG 
ACCTGAATCA 
CTATCAATAA 
TTTTTCTTTT 
TACCCCGCTT 
TATGTGAACG 
ATTACTTCAC 
TCTTCATCAA 
ATACCATATC 
AGGTCATCTT 
CCTTTTTCCC 
CGCGCCAATC 
GCTGCTAAAG 
TTGTCGCTTT 
TATTTTGAAA 



TTAGCTTTTC 
GATTCTGACX3 
GTTTTTCAAC 
TCAAACGTTT 
TAAGTACTTC 
TGATTAAAAC 
AAGTACTATA 
AACTTTGTAA 
TCATATCATA 
CTAATCCATA 
GCTTCATCGC 
CATAATTTAA 
TAAATGTAAC 
CTTCTTCAAT 
GTTCTGTTAC 
TTTTATCTCT 
TTTGGTTATT 
CTACAAGTAT 
CTCGGCATTC 
GOGGTGTTGA 
CAAGGCGTTT 
TATGATCTAA 
GACTAGGTAC 
CAAAATGATT 
ATTCTGCAAA 
TCGATAAATC 
CTGCATATGC 
CAATTTCTGC 
TTTTCTCTAA 
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2760 
2820 
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TGTTGTCAAT GCGCTAAACA ATPAAPATTA CCACTTATTA GCAGACGATA ACAACATTGA 84 0 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 90 0 

TGACGATATC GTCGATAGTA TTACTGCACC TCATATCATT ATTAATACCG GTGCTACCTC 
TGTCATTCCT AACATTAAAG GCCTTGATCA AGCTAAACAC GTCTTCX3ATT CGACAGGTTT 
ATTAAACATT AGCTATCAAC CTAAGCACCT CGTTATTGTA GGTGGCGGTT ATATCGCCTT 
AGAATTTGCT TCAATGTTTG CGAATTTAGG TAGTAAGGTA ACAGTATTAG AACGTGGCGA 



960 
1020 
1080 



1140 



ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

TAAAGGCATT GCATTGCATA CAAATGTTGA AACGACTGAA TTGTCATCTG ACAATCATCA 1260 

TACAACAGTC CATACCAACG TTGGTAACTT TGAGGCTGAT GCAGTACTTT TGGCTATCGG 1320 

GCGCAAACCG AATACGCATT TAGCTTTAGA AAATACTGAT ATCGAATTAG GCGACAGAGG 1380 

20 CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 144 0 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 1560 

25 ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 1620 

TTATGATTAC ACTGAACATC AACTTTTAGT AAGTGCTATA CCTCGTCATA AAATTAACAA 16 6 0 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 174 0 

TACATTATAT GGTAAGCAAT CTGAAGAATT AATTAATATA ATTAAACTTG CGATTGATCA 1800 

AAACATTCCA TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC 186 0 " 

ATTTAATGAT TTATTTAATT TCTAGACAAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 1920 ' 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

TATCTTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 204 0 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 

TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 2160 

AATAAGATTG ACAACCTATC ACGATTTCAG CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

GTAACGTTCT TATATTGACT GCCTGACATT CATCACAAAT AAGGAATGTA TTTTTCATAT 2280 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 234 0 

CTATCTATTT TACTCTTTTT AATATTTTTT TCAAGATACG TAACACGGCT GaCAATAAAA 24 00 

50 AATGGAGCAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCCTATCAAA TCATTTATTG 24 60 

CCCATGATAA ATATTAAATT TTAATGGTTT AATACCATGT TTTGTCCATT CATCATAAAT 2520 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 114 0 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 1200 

CACTAGGCAC AGCTTTAATG GXTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 1260 

CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 1320 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 1380 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 144 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT 1500 

AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG 1560 

GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 1620 

CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 1680 

AAnAAGAACA TGTGAGTGTT 1700 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 
TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA 
ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC 
ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT 
ACTAAGCATC aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC 
CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT 
CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC 
TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC 
GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA 
GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT 
ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG 
AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG 
ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC 



CACTTTTCTU^ 
AGTTACAATT 
AGCTTTTTTA 
TTGTTTAAAT 
TGACACAATA 
TCTAGCAACA 
TGGCGACGAA 
TCAGCAAGAC 
CATTGAAGAC 
GATTTGGGAA 
CAGTTATCGA 
CTTCGAAGAC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
€60 
720 
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AACGATCATT GCTCTTATCG CTCTAGGTTT AAGTCCTTTC GCTTTTTCGC GTTCAAATTT 474 0 

TTCAGCAGTC GCCTTAATTG CTGCAGCTTC CACTTTCTTT AAATTTTTAC CGTGCTTAGC 4 800 

5 

ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT tGTTCCCATT TTGACATCTC 4 860 

ATTTAAATTA GACATATCTT TTTTATTTAA TATAACAACA CGTGGTTTTT GGTTAATAAC 4 920 

TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 4 980 

10 

TACATCTACT TTTTTTAATT 5000 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

AACCCGnAGA CGAAACTnCA TAGTTGCCTG TTATGTTGAT TAGTGCGTTT GTAGCTATTT 60 

25 nAAATCAAAC ATTATTAAAT ACAGCGTTaC CTAGTATAAT GAGAGAATTA AATATCAATG 120 

AAAGTACATC GCAATGGCTA GTTACTGGGT TTATGCTTGT TAATGGCGTC ATGATACCTC 180 

TGACGGCATA TCTAATGGAT AGAATTAAAA CTAGACCTTT ATACTTAGCG GCGATGGGGA 24 0 

30 

CATTTTTATT AGGTTCTATT GTTGCAGCCT TAGCTCCGAA TTTTGGAGTT TTAATGTTAG 3 00 

CTCGTGTAAT TCAAGCGATG GGTGCAGGCG TACTTATGCC CTTAATGCAA TTTACGTTAT 3 60 

TTACATTGTT CAGTAAAGAA CATCGAGGTy TtGCAATGGG ACTAGCAGGT TTAGTAATTC 4 20 

35 

AATTTGCACC AGCAATAGGA CCTACAGTTA CAGGATTAAT TATTGATCAA GCGAGTTGGC 4 80 

GAGTTCCATT TATTATAATT GTAGGAATTG CTATACTTGC CTTTGTTTTC GGTTTGGTTT 54 0 

CAATCTCGAG TTACAATGAA GTGAAATATA CGAAATTAGA TAAGCGTTCA GTAATGTATT 600 

40 

CAACTATTGG GTTCGGGTTA ATGCTATACG CATTTAGTAG TGCAGGAGAT TTAGGATTTA 660 

CAAGTCCAAT AGTAATAGGT GCGTTGATAT TAAGTATGGT TATTATCTAT TTATTTATAC 720 

45 GTAGACAATT TAATATTACT AATGCACTTT TAAATTTAAG GGTTTTTAAA AATAGAACAT 780 

TTGCATTATG TACGATTAGT TCAATGATTA TAATGATGTC AATGGTTGGA CCTGCGCTGC 84 0 

TTATACCGCT ATATGTTCAA AACAGTTTAT CTTTATCTGC CTTGTTATCA GGACTTGTTA 900 

50 TCATGCCTGG TGCAATAATA AATGGTATTA TGTCAGTTTT TACAGGTAAA TTTTATGATA 960 

AGTATGGTCC TAGACCATTG ATTTATACTG GTTTTACAAT TTTAACAATT ACTACAATTA 1020 
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AGATCTTTTC AGGAGTCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 294 0 

CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATTCTTTTT 3000 

GAATAGCACA ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 3060 

GATGTGTCAC CAAAGTTTTC CCTAATAATT CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 3180 

CAACATAAAC ATCAGAATTT AA TT CTTTTG CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 324 0 

CTACTCGTCC TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 3300 

CGTGGATATT CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 3360 

AATGTAAGCG CTATTGTAAA CTTAAATGCT ACTTTTTTAT CATTTAATTG AATTTTACGA 3420 

TTTACAGTAA CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT 34 80 

CATCGATTGC TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAACCATATT 3 54 0 

CAGGATAATC TTTAGATAAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 3600 

TTGCAGCTGC AATGGACACA CTTCTTGCAT CACCCTTGAT TAAAGATACT TGAGGCAGTG 3660 

CATTATCAAG CGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 3720 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT TATAAATATT AAATTCATCT ATTTCTTnCA 3780 

GtGTCGCGAT CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 384 0 

30 GACGTTTCGT AACAGGTACT TTTTTCGAGT CATCAAGGCC CAAATAATTG TGATTTGAAT 3900 

TTAAAATTGT TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 3960 

TCCCACAAAT AATAGCATTA GGATGCTCTT TTAATATTTC ATTTTCAAAG TAAGTCATTT 4020 

CAACATACTT TTCTTTTAAA GCTTGTTCTT TTTCTAACXjC TTTTCTGCGC CTAGCTATGG 4080 

CATTTTGAAC ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 4140 

TATTAACCGC ATTAATCAAC TGCGTAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 4200 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA TAATTTCCrTA TTTTAGCATT TCGAATATCA 4260 

TAAATAATCA GTTCAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCC 4320 

ACGTTTTTTC CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 4380 

ATTATAATGT GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGTTTA ATCCATAGAT 444 0 

GGCAACTTCA TCTAAGTGCA CAATACTATC TTTTATCGCA CCAGTTAAAC TCAACTTCTT 4500 

ACCGACTTCT TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 4 560 

TGCATTACCA ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTAT TACCAGTCTG 4620 
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GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 114, 

GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 12 Ol 

ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCrTCTT 12 6( 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 13 2 ( 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 13 8( 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC ACCGATAAAG CCTACAACTG 144c 

GmTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 150C 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 156C 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 162 C 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 16 80 

CAACATGACC TmTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 174 0 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 18 00 

TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT I860 

25 CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 1920 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTQPAM^QJ^ GGAACGCCTT 1980 

CI^CIiPXnrTQ ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATPATCAA 2040 

GCATTTGTTT TGTATGGAAA AGGGCAGTAG ACCCTGTAAT ACCTTGTACC ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 2220 

CTGAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 2280 

GTGGTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 234 0 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 24 00 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 24 60 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 

CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TGATAAATCA TGTTTAGACG 2580 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 264 0 

TGAATAATGC ATTATCATCA AAATTAATTT TAGCATCTAA TGCCT^TACA TCACCATCAG 2700 
so CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AATACATTAT • 2 760 
AAAGTGCTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 
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TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 1200 

TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 12 60 

GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 1320 

TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 13 80 

ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGGGCA 1440 

ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 14 89 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TTTTCCATCA TrATcACCAT TTAAGACAAT AGCTGTATGA CCATATCCTC CACCA/^TCT 60 

^5 TCCACTAAAA ACCACTAAGT CCCCAGGTTC AGCTTTGAAA GTTGGTGTGT TGTGATAAAT 12 0 

TTTAGCTTCA CTATTAAAAT TATTTGCATA TGGTATATCT TTAGCTCCAT ATCCTTTtAA 180 

TCCAkGACCA TAAAGAkGAT TCCAATATAC ATTAACTAAA TCGAAACATT GCCATCCATA 24 0 

30 ACTACCGTCG AAATCCCATC CTCTATTCTC TAATGTTTTC AAATAATTTA ATGTACTGCT 3 00 

ATTACTATTA CTTTTATTAT TTGAAGACAC TGTTTTTGGT TTTGGTTCTA CTAATGGTGT 360 

CATTGGCACT TTTAATTTTT GACCAATAAA TATTAAATTA GGATTTGCTA TATTATTTGT 4 20 

ATTTTGAATA TTTGAAACTG TAGTTTTGTA TTTTAATGCT ATAGCACTAA GTGTGTCTCC 4 80 

TTTTCTTACA GTATAGATTT GTGTTTTTGG AGCTTCTCTA AAACTGTAGT AACCAAAGTA 54 0 

ATTATTAGTA ACTTTATTCG TTTTCTTCTG ATTAGAATTT TGAGCTTCCA AGTTTGCAAT 600 

TTTAATTTCT TTAGTAAGTT CATTGTTATT AATAACTAGA TTGTTACCTT GGCTTGAGTT 660 

TTTCGGAGTA TTTGAAATCT TTATATCTTG ATTAATTTCA TTTCCGTTTG AAATTGCTGA 720 

TTTGTTGTCT AACTTTAAAC TTGTGTCCGA TGTTTTAACA GCACCTTCAT TTTTTATTTT 780 

GTCTTTTGTC GTATTTTTAT TAGCATTTAA CTCTGATTTC GCGAATACAT TTTGCTCATA 840 

CCCTCTTGTA AAATCTTTAG ATTTATCAAT TTCATCTGCA TATGCTTTGT TCGACATACC 9 00 

CAATGCCAAA AACATACCTA TTGAAATTGA CAAAATTCCA ATACTAACTT TTCTAATTGA 960 

ATAGCGTACT TTACTTTGTT GTTTATTCAT GATGAAACAC TCCTTACAAT AATATACAAA 1020 
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ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 1200 

ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 1260 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 
TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 
(2) INFORMATION FOR SEQ ID NO: 24 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1361 



60 
120 
180 
240 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 
TCAATATGTA AATCGGCGAT TGTCCCTATC CTCATACCCT TACCTTCTTT ACTCAAATTA 
AAATGTTATT TATTATGCCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 
2s GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 
TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT AAAAAAATAA TTGCTTTATT 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 3 00 

30 TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTGCATA GAGAGTTGAA TAACAGACAC 360 

ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT TATTCCTAGG ATCAGGTCAA 42 0 

ACAATATCTT TAACTGGTCC ATCACTGTTA TTCACATACA TGATTATTGG GGTTGTACTA 4 80 

TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 54 0 

GTTGATATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 600 

TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA ATACTTTGCA 66 0 

TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTGTTTT AATCTTGATT 72 0 

AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 7 80 

AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 84 0 

AAAACACATT ATGGACATGC ATCATTCACA AACTTAATCA GTCACGGTGG CATGTTCCCT 900 

GGTGGAaCAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTGGTATT 960 

5^ GAACTTATAG GTGTAaCTGC TGGTGAAaCG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 1020 

ATTAATAATG TACCTATCCG TATTTTATTA TTCTATATCG GTGGTCTATT AGTAATTATG 1080 
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ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 34 80 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 3540 

TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 3600 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 3720 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3775 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9: 

AGAAAAATTA GCCTACCTAT GCAAGTTGTT ATTGCATTAG TGTTAGGTGT TGTCGTAGGA 60 

25 CTTTTATTAT ATGGACAAGA AAACGTAGCA AATTACATTA AACCATTTGG TGACGTATTT 120 

TTAAATTTAA TTAAAATGAT CGTTATACCA GTTGTATTTT GCTCACTAGC GCTTTCTATT 180 

TCGAACGTTG GGGAATCGAA AACTGTAGGG CGTTATGGCT GGaAAACAAT TTTATACTTT 24 0 

30 GAAATTATTA CAACAATCGC AATAGGTTTA GGGATTATCT TCGGTAACCT ATTTAAACCA 300 

GGTGCTGGAT TAGACCCAAC AAAATTACCT AAAGGTGATA TTTCTAAATA TCAATCAACT 360 

GCACATGCAG CAGAACAATC TACATATGGa AATCATTTTA TTGATACCAT TGTACATATT 420 

ATTCCGACAA ACTTTTTTGA AGCTTTAAAT AAGGGTGAAT TATTACCTAT TATCTTCTTC 480 

GCAGTATTCT TTGGATTAGG ATTAGCTGCT GTAGGTAAAA AAGCAGAACC AGTTAAAGAA 540 

TTTTTAAGCG GATCGCTTGA AGCTGTGTTC TGGATGATTA ATAAAATTTT AAAATTAGCA 600 

CCACTTGGAG TGTTTGCATT CATTTGTACT ACAATTATTA CATTTGGTGC ATCCGCATTA 660 

TTACCACTAT TAAAATTAGT ATTAGTTGTT GTCTTTGCAA TGGTGTTCTT TGTATTCGCT 720 

ATACTAGGAC TAGTTGCATG GATGTGTGGT ATTAATATCA TGAATATTAT TAGAATCTTG 780 

AAAAGTGAAT TGCTTTTAGC ATTTTCTACA TCAAGTTCGG AAGCTGTACT TCCTGTAATG 84 0 

ATGAAGAAAA TGGAAAACTT CGGTTCTCCA AAAGAAATTA CTTCTTTTGT TATACCAATT 900 

GGTTATACGT TTAACTTAGA TGGATCAGCA CTTTATCAAT CTATTGCAGC ATTATTCGTT 960 

GCACAGATGT ATGGAATGCA CTTAACATTA TCAGAGCAAA TTGTGTTGAT GTTAACATTA 1020 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT " igg 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 174 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT I80 

TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 18S^ 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 192, 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 198( 

GGTCTCGITA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATCAA GAAAGGACGT 204 ( 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 210( 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATACCATTAC 21 6 C 

AACGCGTACT GATGTACTTT ATAAATCCAA ATCACAGTAT GGATCAATTC ACTTTAGAAG 222C 

^„ TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATCAGTG 2280 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 2340 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 24 00 

25 ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 2460 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 2520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2580 

AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 264 0 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2700 

TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 2760 

TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 2820 

TTGGSTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2880 

CTAARAAGTT ACTAATCTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACITAA 2S40 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3060 

^ ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGXT 3120 
TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 
ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 
SO TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 
AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 
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(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 i near 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 





TGATAATGAT 


TCTCATTGTC 


ATACATCACG 


AAGGAGGCTA 


ATTAGTCAAT 


GAATAAAGTA 


60 


10 


ATTAAAATGC 


TTGTTGTTAC 


GCTTGCTTTC 


CTACTTGTTT 


TAGCAGGATG 


TAGTGGGAAT 


120 




TCAAATAAAC 


AATCATCTGA 


TAACAAAGAT 


AAGGAAACAA 


CTTCAATTAA 


ACATGCAATG 


180 




GGCACAACTG 


AAATTAAAGG 


GAAACCAAAG 


CGTGTTGTTA 


CGCTATATCA 


AGGTGCCACT 


240 


15 


GACGTCGCTG 


TATCTTTAGG 


TGTTAAACCT 


GTAGGTGCTG 


TAGAATCATG 


GACACAAAAA 


300 




CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA 


AGATTGTAGG 


TCAAGAACCT 


360 




GCACCTAACT 


TAGAGGAAAT 


CTCTAAATTA AAACCGGACT 


TAATTGTCGC 


GTCAAAAGTT 


420 


20 


AGAAATGAAA 


AAGTTTACGA 


TCAATTATCT 


AAAATCGCAC 


CAACAGTTTC 


TACTGATACA 


480 




GTTTTCAAAT 


TCAAAGATAC 


AACTAAGTTA 


ATGGGGAAAG 


CTTTAGGGAA 


AGAAAAAGAA 


540 




GCTGAAGATT 


TACTTAAAAA 


GTACGATGAT 


AAAGTAGCTG 


CATTCCAAAA 


AGATGCAAAA 


600 


25 


GCAAAGTATA 


AAGATGCATG 


GCCATTGAAA 


GCTTCAGTTG 


TTAACTTCCG 


TGCTGATCAT 


660 




ACAAGAATTT 


ATGCTGGTGG 




GAAATCTTAA 


ATGATTTAGG 


ATTCAAACGT 


720 


30 


AATAAAQvAL. 1 




AGTTGATAAT 


GGTAAAGATA 


TTATCCAACT 


TACATCTAAA 


780 


GAAAGCATTC 


CATTAATGAA 


CGCTGATCAT 


A'rrriTGTAG 


TAAAATCAGA 


TCCAAATGCG 


840 




AAAGATGCTG 


CATTAGTTAA 


AAAGACTGAA 


AGCGAATGGA 


CTTCAAGTAA AGAGTGGAAA 


900 


35 


AATTTAGACG 


CaGTTAAAAA 


CAACCAAGTA 


TCTGATGATT 


TAGATGAAAT 


CACTTGGAAC 


960 




TTAGCTGGCG 


GATATAAATC 


TTCATTAAAA 


CTTATTGACG 


ATTTATATGA AAAGTTAAAT 


1020 




ATT^^AAAAC 


AATCAAAATA 


ATTAAGGAGT 


TTTACGATGC 


TACTTAAACC 


AAAATACCAA 


1080 


40 


ATCGTTATTG 


CTGGTTTATG 


TCTTGCAATA 


GTAGCTATCT 


TAAGTTTAAT 


GATTGGAAAT 


X X. *± \J 




ACGCTTGTGT 


CACCAGGTAC 


GGTGATACAG 


GCGTTATTCA 


ACTTTGATAG 


TGAAAACGAT 


1200 




TTACATGATG 


TTGTCACTGG 


TGCACGGGCG 


TCGAGAACAA 


TCATTGCGTT 


ATTGACTGGT 


1260 


45 


GCTGCCCTTG 


CTGTCTCAGG 


TTTGTTGATG 


CAAGCACTTA 


CACGAAACCC 


AATAGCCTCA 


1320 




CCAGGGCTTT 


TCGGTGTCAA 


TGCAGGCGCA 


GTArrri'i'TG 


TCATTTTTAG 


TATTACATTT 


1380 




ATCCAAATTC 


AATCTTTTAA 


AATGATTGTA 


GTTATTGCAT 


TTTTGGGGGC 


TATTGTTGTT 


1440 


50 


ACTGTATTAG 


TTGTTGCACT 


AGGTATGTTT 


AGACAAACAC 


TATTCTCACC 


TCACCGTGTC 


1500 




ATTTTGGCAG 


GTGCTGCGAT 


TGCGATGCTA 


TTTACAGCCT 


TTACTCAAGG 


CATACTTATT 


1560 



55 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTOATCTCAT ACCAAATGGA 
CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATr TTCAGTGCAA 
ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 
TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 
GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 

10 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 
GAATTAGCGC GTAGACATCA TAAGCX5CAAT AACTTTAAAG GTGAACGACA AAATATCGAA 
CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 
AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTrAACT 
ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 
20 AATACX3AAAT CAkhAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATCAAGGG AAAAAATGGC 
GAAAAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 
2S TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GATATGCATC CCGAAAGTTA GACTAAAAAA 
CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 

30 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 

CCAASCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 

^5 TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 

50 (2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 

55 



>ISOOCIO: «EP 078651 9A2_I_> 



1020 



10 



16 



20 



EP 0 786 519 A2 

AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3 000 

TGGTATTCAA TTCCATCCTG AAAT^GCGG TACATATGGG TTACAAATTT TGCXSTCAGGC 3 060 

AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 324 0 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3 300 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCX3TAC GAAGTCACAA ATCATGGACT 3360 

ACITTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 3480 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 354 0 

TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3 600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 3660 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

2S ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCrrrTTGG GAGGGGCTAA 378 0 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3B4 0 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3900 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 96 0 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4 02 0 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 408 0 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 414 0 

CGGT^CAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4260 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 4320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4380 

CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 444 0 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4 560 

AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4 620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 4680 
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GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTCATTT 
TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG 
* ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACX3AAA ATCX3ATGAAG TACAACCATC 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGGCAnTTT 
AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 
TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 
ATTATCAAAG GCGTTTGGAA TTGCCX3GCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 
AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 
TATTGCGACT TATATTTTTA GACATAGAGA AGAGACAAGA CAATTTTTAA CGATGCAACG 
ACAGTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 
20 GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA 
TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA ITAGAAGAAA TTGTTAAAGA 
ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 
TTTCAATATC TGATGATCAG TCACCATOGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 
ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACATTGAG GCACAAGGTG 
ATATTGACGT AGATGATCAC CACGTAACTG AAGATATCGG CATTGTCATT GGCCAATTGT 
TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCGTTA TGGAACGATG TACATTCCAA 
TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 
ATGCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 
TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 
ATACACAGCA TGAAATTGAA GCTATATTCA AAGCGTTTTC CCGTGCATTA GGCATAGCGC 
TAACTGGAAC TGATGATCAG CGTGTGCCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 
CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 
45 TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 
ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 
TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 
TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 
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TCCAATG CTT 


GTTGGCGATA 


TGGCATATGA TGGAATTGCA 


CACGATCCTT 


TGTCAATACA 


7260 




CCTAGCGGAC 


CATGATTTGG 


ACTTTGAAAT GTGTTTGTAT 


TAGACGTATG 


TGTTTTGGTA 


7320 




ACATTACGCG 


CCGTATGAAT 


TTCATCATTA AaTACAACCA 


TCACGCCTTT 


ATGACGGGCC 


7380 




TTTTCATCAG 


AGGCAACGCG 


AATAGCGGAA ATATAATTAT 


ATAATCCGTC 


AGAACCAATT 


7440 




TCATTAGACG 


AGCGCATTGG 


CCAAGAAnTA ACAACAGGTT 


G 




7481 


10 


(2) INFORMATION FOR SEQ ID NO: 23 7: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 








ATGAATTATA 


CCTTCCACTA 


GAAGTGTCGG lAl 1 




AAGCAAGTTA 


60 




TCCTTCAACG 


^i^n^^n^n^ ^^^^^^ 

GTATTAATGA 


CAGCGACGTT AQj w4.L-AA^j i A 




AAAATATTGT 


120 


25 


CGTTGTGACA 


CCACCTCAAC 


CTAATGGTGT GTCTCAAOAA 


uXAx XAvjn— AvJ 


CATGCTATAT 


180 




TACGCAAGTT 


GATCAAGTAT 


TTCAAGTTGG TGGTGCTCAA 


AGTATTGCTG 


CGTTAACTTA 


240 




TGGCACAGAA 


ACGATACCTA 


AAGTTGATAA GATTGTAGGT 


CCAGGCAATC 


AATTTGTTGC 


300 


30 


ATATGCCAAG 


AAATATTTAT 


TTGGACAAGT AGGTATTGAC 


CAAATAGCTG 


GACCAACAGA 


360 




AATAGCACTT 


ATTATTGATG 


AAACAGCAGA TTTAGATGCC 


ATCGTATATG 


ATGTTTTTGC 


420 




ACAAGCAGAA 


CATGATGAAT 


TAGCACGTAC ATATGCCATT 


AGTGAAGATG 


CGCAAGTCCT 


480 


3S 


TAAAGATTTA 


GAATCACGTA 


TTGCTAAAGC ATTGCCTAAT 


GTGGACAGAT 


ACGACATTGT 


540 




TTCTAAAAGT 


ATCGCTAATC 


AACACTACCT TATCCATGCT 


AGTAATTTTG 


ATGAAGCATG 


600 


40 


CCATGTCATG 


AATACAATCG 


CGCCTGAACA TGCGTCGATT 


CAAACAGTAA 


ATCCTCAACC 


660 


ATATATTGAA 


AAAGTGAAAT 


ATGTGGGTGC ATTGTTTATT 


GGACATTATT 


CGCCAGAGGT 


720 




CATAGGAGAT 


TACGTTGCAG 


GTCCAAGTCA TGTATTACCT 


ACAAATAGAA 


CAG CT AGATT 


780 


45 


TACCAATGGG 


TTATCGGTCA 


ATGATTTCTT AACACGGAAC 


ACGGTCATCC 


ATTTATCAAA 


840 




AGATACGTTT 


GAACAAATTG 


CTGATTCAGC ACAACATATT 


GCTCATGTTG 


AAGCATTATA 


900 




CAATCACCAG 


CAGTCTATTT 


TAATACGTCA GTCTTAGGGG 


AGTGTAATTG 


AAATGATTTA 


960 


50 


TATTGATAAA 


AATGAAAGTC 


CAGTTACGCC GTTGGATGAA 


AAAACAATGA 


CGTCTATTAT 


1020 




TAGTGCAACg 


CnATATAATT 


TATATCCTGA TGCAGCATAT 


GAACAATTCA 


AGGAAGCTTA 


1080 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTCT TATTTAACAA 
GGTCTTGTCr CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC PAPJ^GATAA 
AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 
TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 
TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 
TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 
CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 



5460 
5520 
5580 
5640 
5700 
5760 
5820 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 58 80 

CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 594 0 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 
^0 CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 6060 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 6120 
CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 6180 
CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCX5GCTATC GATGCTGCTT 6240 
TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 6300 
CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 6360 ' 

ACATGGAATT TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 64 20 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 64 80 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 654 0 ' 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 * 

1TGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 
TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 
CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6780 
GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 684 0 

45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGCT 6900 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6 96 0 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 7020 
GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 7 080 

TGTCCTAACG CTTCAATAAC CATACCATCG ATACCTTCTC GACTATAAAA ACTAAAAATG 7140 
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ATGTTGTGAC AATGATTCTA ATAT/IACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 3660 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC 3720 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3 84 0 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3 960 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4 020 

CAAAGAAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 4140 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4 200 

^0 GAAGAGGAAT ATTATAATAA GAATGCTTTT GCAATGGATA AATCACATCC AGAACCAATC 4260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 4320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 43 80 

GCTAACCAAT CAGAAACATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 4440 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4 500 

GGTGTTGCAG GTCCAGCTCG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 560 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4 6 20 

GATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4 6 60 

GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4740 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4 800 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4860 

40 TTAGCGATAT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4 920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 4 980 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 504 0 
''^ GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 5100 
CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 
GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 
TACTACGGTT CAGGTTCACC GGAAAATGTT GAAA^AATTA GACGTGCCAA TGGTTTAAGT 5280 
GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 534 0 
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AATGAACTAC ATAACrTGTT TAAAGCAATA ACTTTAAAAG GGCCATGTTA CTTACATTAT i860 

TATTTGCAAG GCTATGATGA ACCAATGTAT ACGAGACAGC AAGTTAGTTT AATAGAAAAG 1920 

CTATCTCAAC AGCAATTGTT TGAATACGAA ATGAATAATT TAGTGACAAT GATGTTTGAA iggQ 

TTAGAAAGTG GAGAATATAC TATTTTATCA AAAATAATAA TGAAACCTAC ATTATTAAAT 204 0 

CAAACTTATA TTACTTATAC AAAATTGCTT GAACAATTCA CGATGGAAGA TATAGCGGCT 2100 

CAACAACAAG TTAAAATCAA TACTATCGAA GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

TACATGTCTA ATTACGATGA TTATGTTGAA CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TATCAACAGC ATCGTGGCGA ACGATTAAAA TTTTACAAAG AACAATTTGA CACGTTATCA 2280 

TATTTTCAAT TAAAAGTATT AATCGTTGGA TTTGAAAGAG GTGATCTGAA TGTTGCATGA 234 0 

TATTTTACGA AACAAATTTG GATTCGAGAG TTTTAAACCG GGACAACAGG AAATTATAGA 2400 

20 AAGTATAATG TCTCAACAAC ACACTCTAGG TATACTTCCA ACTGGAAGTG GAAAGAGTTT 2460 

GTGTTATCAA ATACCTACGT ATTTATCAGG TAAGCCGACA TTAATTATCT CACCGTTAAT 2520 

ATCTTTAATG GATGACCAAG TTATGCAGTT GAAAATAAAT GGAGAAAAAC GTGTAACATG 2580 

TATTCACTCT GGTATGGATG AAATTGAGAA AAAGCATAAT ATTAAATGTT TACGACATAG 264o' 

CCGCTTCATC TTTCTAAGTC CAGAATTTCT CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TATGATAGAC TTTGGCATGA TTGTTCTAGA TGAAGCACAT TGCCTATCTG AATGGGGATA 2760 

TGATTTCAGA CCACATTATG CTCTAATAGG AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2 820 

TGTCTTAGCA TTGACAGCAA CTGCACCACC GCATTTACAA GATGATTTGA CGGAAATGTT 2 880 

AGCGATTCAA TTCAATGTTA TTAAAACTAC AATGAATCGC CCAAATATAA GCTTTAAGCA 2 940' 

TCTTAATTTT CATGATGATG AAGATAAAAT TGAATGGTTG CTGCCGTTTC TACAACAGTC 3000 

GGGAGCAACG ATTATTTATG TCTCATCGAA AAAGATGTGT CTGAATTTAG CX3CAACTTAT 3060 

40 TTATGATTCA GGTTTTCTTA CAGGTATTTA TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

CACAGTTGAA CAACAATTTT TAT^TAATGA TATTCCGATT ATAGTCGCAA CGAGTGCTTT 318 0 

TGGTATGGGA ATTAATAAAA AAGATATTCG CACAATCATT CACTTTCATC TTTCAACAAG 324 0 

TCCTTCTAAC TACATTCAAG AAATTGGCCG TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 3300 

AATTAGTTTA TTCCAACCGG ACGATAAATA TATTTTAGAA ACGTTATTAT TTGCAGATAT 33 60 

GATAACAGAA GAAGATGTAC AAAATTTCGA AATAGGAGAA TTTTTAGCTC CCGATAAACA 34 20 

AGCCGTTTTG ACAACGTTGc AATCATTCTA TAGTATCGGC GCCTTGaAAC AGATATTTAA 34 80 

GCAATCATTT AAACGAAAGC AATTAGGATT CTTTCGCATG ATTGGCTATT GCAAATTGGA 354 0 
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TGAGTGATAG AATCAAAAAA GCCATCTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 60 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACGTA ACTATGATTT AATTTAAGCA 12 0 

TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAATTAATG AGAATTAAAT TTAATAATGT 180 

AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC AGTATAGTCT 3 00 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT ACGAGATGAA 3 60 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACGTCTT ATCACAATAA GTATGTTGAG 420 

CGCGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 4 80 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 54 0 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 

20 AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 660 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCGT 720 

TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT ACGGTATGAT 780 

ATTTAACTTA GCTGATATCG CAAATAATCT TAAAGTAATC ATTGTTTCAG GAATTATACC 84 0 

ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 

TGCGAATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 96 0 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTGCTT TTTCTATAGA ACATTTGTCT 1020 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CATMrATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 1260 

40 CGATTGTATA TTTTGCCAAT GTCTTCX3CCT CCTTTGATAA AAATGCTAAA ATAGTAATGT 1320 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCCGTTTTAC AGAGTGAGGG 13 80 

TTAAATTTGC AACACATTAT AAAAACAGCA TTACAACAAA CATTTAACTA TAAAACAAAT 1440 

AAAAGTATTT ACAATATCTT AGTTGGTAAG AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

AGTCAACAAC AGTTGTCATT ATATCACAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

CTATTTtTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

SO 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

TCTAATACCA AACAACATGT TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 1740 
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AAAGCGAATT AGATAGAGTA GATAAAAAGC TCTCTAATGA AAACTTTGTA AGTAAAGCAC 
CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA TTATCAAGAA AAATATGATG 
GTGTGAAGGC AAGAATTGAA CAATTAAAAG CATAGGAGTT AGTAACAATG AATTACCTAG 
AGAGCTTGTA TTGGATACAC GAAAGAACTA AATTTGGCAT CAAACCAGGT GTTAAACGTA 
TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA CATTAAGGGT ATTCATGTAG 
GTGGCACAAA TGGTAAAGGC TCTACAGTTG CTTACCTTAG AACAGCTTTA GTTGAAAATG 2460 
GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC ATTTAATGAA CGAATTAGTC 
TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT AGTATCACGT ATTAAACCAG 
TAAGTGAAAT GATGGAACGT GAAACAGATT TAGGTGTTGC AACTGAATTC GAAATAATCA 
CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT TGATTTTGTC ATTGTTGAGG 2700 
CTGGATTGGG TATAAAGAAC GATTCGACAA ATGTCTTTAC ACCGGTTTTA TCAATCTTAA 276G 
CTAGTATCGG TCTAGACCAT ACAGATATTT TAGGTGGTAC TTATCTAGAT ATTGCTAGGG 
ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA TGCTGTTAAA AATGAAGATG 
CATTAAAATA TGTTCX3TGAA CGCGCAATTG AACAACATGC AAAGCGAATT GAATTAGATA 
GAGAAATTGT TGTTGTATCG CAAAATGATG AATTTACTTA CCGTTATAAA GATTATGAAT 3000 
TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA ACAAAATGCT GCATTAGCCA 3060 
TAACAGCTCT TATTGAATTA AATGAACAAG GATTAATTGA ATTAGATTTC AATAAGATGA 312 0 

TAGACGGTAT TGAATCAGTT CGTTGGACTG GACGTATTGA GCAGGTGCAT GACAAACCTT 3180 
TAATCATTTT GGATGGCGCA CATAATTCAG AGAGTATAGA TGCTCTAATT GATACAATTA 324 0 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC AGCAATAAAC GGAAAACCGA 3300 
TTAAfiSAGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC GTTTTATGCA ACTGAATTTG 3360 
40 ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG TATTGAAAAT GATGAAATAC 3420 
Mi-TTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA AGGTGATACA TTAGTAATTA 3480 
CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 
(2) INFORMATION FOR SEQ ID NO: 236: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 81 base pairs 

(B) TYPE: nucleic acid 
50 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2820 
2880 
2940 



3516 



55 



NSOOaD: <EP 078651 9A2_I_> 



1012 



10 



IS 



20 



EP 0 786 519 A2 

TACAGCGATT GTTGTTAACC CTAATGACGA ACGATACAAA GATGTAATCG GTAAAACTGT 360 

TATATTACCA ATCGTAGGAC GCGAACTGCC TATTTTAGCA GATGAGTATG TTGATATAGA 420 

CTTCGGTTCT GGTGCTATGA AAGTGACACC AGCACATGAC CCTAATGATT TTGAAATTGG 480 

TCAAAGACAT CAATTAGAAA ATATTATCGT TATGGATGAA AATGGTAAAA TGAACGACAA 540 

AGCGGGTAAA TATGAAGGTA TGGACCGTTT TGATTGTCGT AAACAGCTAG TTAAAGATTT 600 

AAAAGAACAA GATTTAGTTA TCAAGATTGA AGATCATGTT CATTCTGTAG GTCATTCAGA 660 

ACGATCTGGC GCTGTTGTTG AACCATATTT ATCAACACAA TGGTTTGTGC GCATGGAAGA 720 

CTTAGCGAAA CGTTCATTAG ATAACCAAAA AACAGATGAT CGTATTGATT TTTATCCGCA 780 

ACGTTTCGAA CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 84 0 

ACAATTATGG TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

TGTTGGAGAA GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 960 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 1020 

TGAAGACTTT AAACGATACT ACCCAACAAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 1080 

25 TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATCGTC GTCCATTTAA 114 0 

TGATGTATTA TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT 1200 

AGGTAATGGT GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 1260 

30 CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAACTG AAAAAGTTGA 1320 

GTCAGTGTGG AACTTTATCA ATAAAATCTG GAATGGGGCA CGTTTCAGTT TAATGAATAT 1380 

CGGTGAAGAC TTTAAAGTTG AAGATATCX3A TTTAAGTGGT AACTTATCAT TAGCAGATAA 144 0 

ATGGATTCTA ACACGTTTAA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TGAATTCGGC GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 1560 

GTACATTGAA ATGAGTAAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AACAAGTTAC 1620 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ATTTGTAACA GAGAAAATAT GGCAAAGTTT ACCACATGAA GGTGACACAA TTGTTAAAGC 174 0 

TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AGTAAACAAA CAATGCAACA 1800 

ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACGTGTA GAAGTAAATA CGCCATTGTC 1860 

TAAAGAAATA CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATTGAAACAA CTTTATCACA 1920 

AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACGTGG 1980 

AAAwTCCTGA GAAAGCAATG ACaTCAGTTG TAATTGCAGG TAAAGTGGTA TTACCATTAG 204 0 
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TTCATGGGCC ATTAAATGCT CTTATTAATT ATGATTATGT ACATACTATG CAACAGGCCA ' 3 66o 

TAGACAAGCG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 3720 

TAGGTTCTTC ATCATACGAT GCGCCAgCTG TATTATCTAT GTTATTCCAT ATGCAACT^G 378 0 

AGCAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 384 0 

tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 3900 

ATCAAAGACG TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGATTATA 3 96 0 

TTATTTCAAA TATGGAAGTC ATACCTACTT ATAAATATTT AATTCACCTT GATACTCAAC 4 02 0 

GATTAAACAA ATTAGAGAGG GAATTTGAGC CGGCAAGCTC AGGATATGTG ATGCATTTAG 4 08 0 

GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAAATGCTT 414 0 

ATCTCAATTA TcAACAAGTT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 4200 

20 TAGTAAATAC GAATAAPiACT GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 426 0 

TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 432 0 

TTAGGGATAA AATTTTGGAT AAATTAGAAA AAATGGGACT TACTGATTTA AGAAAACACA 43 8 0 

TTATTTATGA AGATGTTTGG ACACCGGAGg ATATTGAAAA AAATTATCGT TCTAATCGTG 444 0- 

GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 45 00 

AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGTGGATC AGTAAATCCT GGTGGTGGCA 4 56 0 

TGCCAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 462 0 

GAATAGGAAG TGATATCTAT G7UVATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 4 6 80 

TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 4721 
(2) INFORMATION FOR SEQ ID NO: 235: 

r (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TAITCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 240 
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GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATC ACGTTCTTAT TACATAGATT 
TAAACCGAGA AATATTATTC AAACGCTATT TATTGTATCG TTGATTTCTT TAGGACTTAT 
GATAGTGATT CATTTCATCA CTGGAGATAA TTCACGTGTG TATTTTGGGA CAGATACACG 
ACTGCAAACT TTATTGCTTG GTTGTATATT AGCATTTATT TGGCCTCCGT TTGCTTTGAA 
AAAAGATATT TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGGTTTTGC 
GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 
ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 
TTTATTTGCT AAATTTTTAA GTATGAAACC TTTACTAATT ATAGGTAAAC GATCATATAG 
CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 
AATACCGGTA TACGTTTATA TTATAGAAAT TTTGTTAACA GCGTT7VATGG CTGAAATTTC 
GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 
TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 
CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 
GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 
TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 
CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA ATACCAAATG CACAAATTGA 
TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 
CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 
AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA GACATATATT TAGTTTCTAT 
TAGAGTACCT AGAGATTATG AAGGTAGAAT AAATAAATTA ATTTATGAGG CAGCTGAAAA 
GCGCTCTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 
TGCATATGAC GGTATTCACT TAGAATATGC AGGTAGTAAA GCGCTGACTG ATTTGATTGT 
AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 
TATTACATTA CTTCTGATAT TTATTATCAA AAATGATGTA TTTCATTAAA AGTTTAGTGC 
TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 
GTATATTACA TTCAATGAAG CTTTATTAGG AACAGATTAC ATTATGATAA CAAAGCCCGC 
AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 
TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA ATGACTATAT 
GTATAGAGTG TTTTGATTAT TGGGAGGATA TTTAATTATG AAAAAAATCG TTACAGCTAC 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTAGTTTA TTAAAATAAA GAGTAATTTA 5100 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 516 0 

TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 5220 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 5280 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 534 0 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT 540 0 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 5460 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 552 0 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 564 0 

TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 570 0 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 5760 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5820 

2S ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 5880 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CZACGATCTGT 594 0 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA CTAATTCTGT 6060 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 6180 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 6240 

GAAATTCTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCTAATTACT 6300 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 6360 

TTTTTTATAA CCATGGATAT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 6420 

TTGTACTTnT TCCGCTTCTG GATT 6444 
45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3300 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 3360 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 3420 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 80 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3 54 0 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3600 

ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC 3 660 

75 ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATTTT TATCTTCTAA AGTACCATTT 3780 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 3 84 0 

^» * w *. jt w w A n A x« A. v-Hf-wn X w a r-»>— r-v\J vj a vj^-XTIJ-l X rtJ-lV, J. VJV_ XAJV^-J^VJ i il J I? LI U 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3 960 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAG TT TTTAG CTAATTCGAT AATAGATTTA 4 020 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4 080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 414 0 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 4200 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 426 0 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCT^CCACCAT AATATGCACC TGAAGGAGAT 432 0 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC TTCTGTTGCA 43 80 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 444 0 

ATAreAACTA ATTGTTTTAG CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4 500 

AATCGTGCTA ACGAGTTATT AAGACGCTTA AAATTTTCTT CAGGACGGAA AAGTGCAACT 4 560 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4 620 

TGTGCAGCAG GTG7VAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4 680 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 474 0 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4 800 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4 860 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 920 

AGACTATATC TTTTAGTATA AACX3TATTAA TTTCACAGAG ACAAGTAATC TGTGTTTTAC -6980 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

AATGGAGTTT TCAAACTGAA ATGTTCTTTT TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

TAACACAACT AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGT^GtG AtAcTGTaAg I860 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

TTAATCGATA CnACTTCTAT TGATTTTCTT AAAAATGAAC AAGATTACX3A AGATATATTA 198 0 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

20 TGCCATTATA ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA sGCAAGCACT 2100 

TGCAGATAAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 

AATTTATGAA GATGTTGATA TTTTTGCAAA AATGCATGAA GAAGAAGGCA CAATGAGTAA 2340 

AGAAGATTTC AAAACATATT CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 2400 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 24 60 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 264 0 

AATTATTCAA ACATATCGAC AAGTAGATAC ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

40 GTTTATATTC AATGTATATT CCATAGATAT TATOGATTAT TTTATCAATT CTATCGAATA 2760 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2 820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCATATCATT AGCAGTGTCT 294 0 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3000 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3 060 

GTAGCATTGG TATGTGCTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6444 base pairs 

(B) TYPEr nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


233 : 








TGATAAGTCA TTTAAATTGT 


CACCTATTGA 


CATGACTTCT 


TTCATTTCAA 


TCCCTAATCT 


60 


IS 


TTCGGCAATT 


GTTTCTAGCG 


CAATACCTTT 


TTGTGCATCT 


GAATGCGTTA 


TTTCTATATT 


120 




TCCTCTCGAA 


GATGATGATA 


TAGCTAAATT 


CGGAGAkTCA 


GCTAAAATTT 


TACTAGCTTT 


180 


20 


GTCAAI-rriT 


TCTAAATTTC 


CATCAAATGC 


TAATATTTTC 


ATAATTAATT 


CACCAGGTAT 


240 




GCATCATAAT 


TATCAACAAC 


TyTCAACGTA 


CCATTATCTA 


TGCGTCTTTG 


300 




AATACCATTT 


TTAATACGCT 


CAACGTTTGC 


ATGTTGACCT 


GCACGCTCAG 


CAATATCTAT 


360 


25 


GTAAATGTCT 


AAATCTCTTT 


GTGGATCTTC 


AGTATAAATC 


GCACGACTCG 


TGTATACTTG 


420 


ATAATAAATA 


CCTGCATCTT 


TTAAAACATT 


TGTAATTTTG 


TGTACTAACG 


ATTTATTAAG 


480 




GTGTGAAGTG 


CTCATTACAT 


TGAAAGTTTC 


ATCACGTACT 


TCAGCACCAT 


TCAAACAAAT 


540 


30 


ATATGGTACT 


GTTAAATCTG 


TGTCAGCAAC 


TGGTGCTTGk 


GCTTCATAAA 


ATGCTCGACC 


600 




TGTCGCGATA 


ACAACCGTTA 


TCCCTTGTTC 


TTGAGCGTAT 


TTAATCGCAT 


CAATATTAGG 


660 




TTGAGAAATT 


TCATGTGCTG 


CATTAAGTAG 


CGTGCCATCC 


ATATCAGTGG 


CTATTAGTTT 


720 


35 


TATCATTATG 


TnACCTCGTT 


TCGTAAATnT 


AAAATCTTGT 


TCTTAAATAA 


GrATATATAC 


780 




TCAGCGCACA 


TACTTTtCTA 


TTAmCATTTA 


TATkGTCATT 


aATTTATCAT 


ATAATGTAAT 


840 




TCTaACAAAT 


nTTAAtTAGT 


ATGTACTATC 


GTCTAATTGG 


TGGATTTCTT 


ATTGGCTCTT 


900 


40 


AAgTTTTTAA AAAATGTTGT 


TAATAATGTG 


CTACATGCTT 


CTTTAAGTAC 


ACCTTTATCA 


960 




ACAATTGCAC 


GATGATTAAA 


ATTAGATTGT 


TGCAATAAAT 


TCATTAAACT 


GCcACTACAA 


1020 


45 


CCACCTTTAG 


GATCATCTGC 


GCCATAGACG 


ACTCTTGGAA 


TGCGACTCAT 


TACAATTGTT 


1080 


CCTGCGCACA 


TGACACATGG 


TTCTAAGGTT 


ACATATAATG 


TGCAACCTTC 


TAAACGCCAA 


1140 




CTACCTAACA 


CTTTGGCTGC 


ACGTTCAATT 


GCAATATGTT 


CAGCATGCGC 


CGTTGGTTGT 


1200 


SO 


TGTAGTGTTT 


CTCTTaAATT 


ATGTGCTCTA 


GCGATAACTT 


CATCATCTTT 


AGTGATGATA 


1260 




GCACCTATAG 


GTACTTCGCC 


TAGTTGAGCT 


GCTTI-riTAG 


CTTCTTCAAT 


CGCTAATGTC 


1320 




ATAAAATATA 


TATCATTTGT 


CATTTATGTC 


CAGATACCTC 


ACTTATGGTA 


CAATACTCAA 


1380 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCX3AT AAATCACGTT TTTGTGTATG 3060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

^ CACACTATAG ATTtCTGATG ATAACGTTTC CCTGACTGT 3159 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 12 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 60 

20 GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

ATAAATCATC CATCCATACA TTGATAAATG ATTTTyAGAA ATTAACAACA AAATCAACAA 180 

TTTTAAACAT CTCTGTGATT CTATTTATTC GAAATGATTT AAAAAATAAA ACTTCAAAAA 24 0 

CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AATGAATAAA AATATAATCA 300 

TCAAAAGTAT TGCAGCATTG ACGATTTTAA CATCAGTGAC TGGCGTCGGC ACAACAGTGG 360 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AACATAATGT GAAACTAATC AAAAATACTA 420 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GCACATATTA 54 0 

TAGCGCATCC CAATGGTGAA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC 600 

GTTATTCAGG TCAAGAAGAT ATTGCCATTC TACATGTGGA AGATAAAGCT GTTCATCCAA 660 

AAAACAGGAA TTTTAAAGAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 720 

40 ATGAACGCAT TTCAATTGTT GGCTATCCAG AACCATATAT AAATAAATTT CAAATGTATG 780 

AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GCAACATGAT TATTACTGAT GCTTTCGTAG 84 0 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 96 0 

AAATTAAGAA ATTCATTGCA GATAACACAG ATAAATAAAT CCTTACATAG ATAAATGATT 1020 

TTAAAAATTA ACAACAAACT CAACaATTCA AATCATCTCT GTGATTCCAT TTATTCGAAA 1O80 

TGATTAAAAA T^AATAAAACT TCAAAAAGCT AACATTATAA TTATACAAAT ACTTAGAGGA 114 0 

GCAGAAAAAT GaATA/U^AAT ATAATCATCA AAAGTATTGC AGCATTGGAC G A TTTTTAAC 1200 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT TTCAAACAAA TCAAATGATG CyTGTTCAAA 1260 

GTCTTTTTGA GATAAAGTAT CaGTTGTTTC TTCaACACTT aAGTTTAAAT TTTCTTGATT 13 20 

AATTTCCAGG TTCATTTTCG ACCATTTTTA AATTTGATAT CGATGATtTT TCACCAGCAG 13 80 

ACGCTTCAAA CTCGCTTAGA ATCACTTGTG CTCTGCTAAT AACTTTTTCA GGTAAATCAG 1440 

CTAATTTCGC AACTTGAATA CCATAAtATC GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

GAATATAAGT TCACCTTTAT ATTCATTAGC AGCGACGTGA ACATTTTTTA GACTTGGTAA 1560 

TGCTTGATCT AATGTTGTCA ATTCATGATA ATGTGTTGAA AATAACGTTT TAGCATGTGA 1620 

TGTTTCAGCT ACATACTCTA TCATTGCCTG CGCTAAAGCT AAACCGTCAT ACGTTGAAGT 1680 

ACCACGTCCA ATTTCATCGA AAATAATCAA ACTATCCTCT GTTGCATAAG TTAATGCCTT 174 0 

TTGTGCTTCT AGCATTTCTA CCATAAACGT ACTCTTACCT GAAACCAAAT CATCTGCCGC 1800 

20 ACCTATTCTA GTGAATATTT GATCAAATAT AGGTAACACT GCCTCTTTAC AAGGGACATA 1860 

AGCTCCCATT TGGGCCATTA TACTAATTAT GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AGACATATTC GGACCTGTAA TTAAATATAT AAATGTTTCA TTATCTAATC GACAATTATT 1980 

AGGCACATAG TCATTATAAT CCATTACTCT TTCCACTACT GGGTGCCTAG ATTCCACTAA 204 0 

TTCTAATGTT TTATTTTCAC TAAATGAAGG CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

TGCAAAGCTC TGTAAACAAT CTAGCTCTGA AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AGTATATTTT TTAACTTCTT CACGTAGCTG AACAAATAAT TGATATTCTA ATTCGATGGC 222 0 

TTTGTCTTCC GCACCTAAAA TGATATCTTC TTTTTCTTTA AGTTCATCAG TTATAAAACG 2280 

TTCAGCATTC GATAACGTTT GCTTCCTCAT ATAACCAAAT TCACTTGGTT CAAAATTTTG "234 0 

CAAGTTGGCA CGTGTTATTT CTATAAAATA ACCAAACACT TTATTAAAGC TTATTTTCAA 24 00 

TGATTTTATT CCTGTACGTT GTCTTTCTTT GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 24 60 

40 GTTTTTTGAA GCTTCAAGAT ATTCATCTAA TTGCX5TATTA AAACCAACTT TGAATAGTCC 2520 

GCCATCTTTA ACTGAAATTG GTGGTTCTTC TACTAAACTC TGTTCTAATA TATCAAGTAA 2580 

ATCATCaAGG GGTTCTAGTT GATTAACTTG TACAAGAGTA TTCTGATTCA TAGAATTTAG 264 0 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

ATTAACATTT CCGTAACTAA CACGCCCAAC AAGACGTTCA ATATCATACA CTTGATTAAG 2760 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA ATGAGCACTA AATTCATCAA CGATATCTT^ 2820 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2880 

TGCTCCCATT GGTGTTTTCG TTTCGTCCAT TAGCCAAAGT AGCGTTCCTT TTTTTGATTT 2940 
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CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 1920 
TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 1980 

5 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 204 0 

TTAttATTGA AATCgATACT AATAtCTccC TG 2 072 

(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C> STRANDEDNESS : double 
,^ (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
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CGTCTTCTCT 


TGGTTATTCT 


CTGTGTTTTG 


TCTTTGTTCA 


AATTCGATTT 


TTTGTTGTTC 


60 


GAATGAATGT 


AATACTTTAT 


i-rrrrriTGG 


CACATAATCC 


AAGTnATTTT 


TAGGAATTAA 


120 


TATACGGTCT 


TTAAATGCTT 


CTTGTATTTT 


GCTCACAATC 


AATTGGTATA 


GTTGCTCTTC 


180 


TTTTGATAAA 


CGCACTTCTA 


GTTTTGTTGG 


ATGAACATTT 


ACGTCTACTA 


AGATTGGATC 


240 


CATTTCAATA 


TTAATATAAC 


AAATCGGGAA 


CCTACCTATT 


GTTAAGAGTG 


TATGATAGCC 


300 


TTCTAAAATC 


GCTTTATTTA 


GCATAAAGTT 


TTTAATGTAT 


CGTCCATTAA 


TAAAAATAGA 


360 


AATATAGTGC 


TTATTACTTC 


TAGAATGTTC 


AGGCTTTGCA 


ACAAAACCTT 


CAATGTGATA 


420 


ATCACTTGTA 


TCTCCAGATA 


TATGTACTAA 


ATCTCGTGCA 


ACTTTCATCC 


CATAAATCTC 


480 


TGCCATCACT 


TCATTAGTTC 


GTCCTGAACC 


ATTTGTACTT 


AACATTGTTT 


TGCCATCTGA 


540 


AATGAGTGCT 


ATTCGAATGT 


CCGGATGGCT 


CATTGCCATT 


CTGTTGACAA 


TATCTGTTAT 


€00 


TTTACCTAGT 


TCAGTGTATA 


AACTTTTAAT 


ATATTTTAAA 


CGTGCTGGTG 


TATTATAAAA 


€60 


TAATGATTCT 


ACAAGTATAT 


CTGTTCCTTT 


TTTCGCTTTT 


GCAGGCTTAT 


GATTTAATAT 


720 


TTCACCATTT 


TCTACATATA 


TTTCATTTCC 


ATTAGCATTA 


TCCGTGCAAG 


TCTTCAATGT 


780 


TACTTTAGCA 


ACTGATGAAA 


TACTGGCTAG 


TGCTTCACCA 


CGGAATCCTA 


ATGTCCTAAT 


84 0 


ATGAAATAAA 


TCTTCATCTT 


GATCTAATTT 


ACTAGTCGCA 


TGTCTATGAA 


ATACTAATCC 


900 


TAAGTCTTCC 


GCTTCAATTC 


CGCTTCCATT 


ATCGACTACG 


CGAATAGATT 


GGACGCCAGA 


960 


tTCCTCTACT 


TCAATGCTTA 


TTTCTGTAGC 


GCCTGCATCT 


ATAGCATTTT 


CCAACAGTTC 


1020 


TTTCACAACA 


GAACTCGGTC 


TTTCAACTAC 


TTCACCTGCT 


GCGATTTTAT 


TTGCTAATGA 


1080 


GGTTTGGAGT 


TCTTTAATTT 


TCCCCATTTT 


GCAACACCTC 


TATTTTAATT 


GATTTTGTAA 


1140 
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